JP7631308B2 - エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮 - Google Patents

エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮 Download PDF

Info

Publication number
JP7631308B2
JP7631308B2 JP2022508996A JP2022508996A JP7631308B2 JP 7631308 B2 JP7631308 B2 JP 7631308B2 JP 2022508996 A JP2022508996 A JP 2022508996A JP 2022508996 A JP2022508996 A JP 2022508996A JP 7631308 B2 JP7631308 B2 JP 7631308B2
Authority
JP
Japan
Prior art keywords
prefix
data
encoding
prefixes
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2022508996A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022545644A5 (enExample
JP2022545644A (ja
Inventor
ディー. ブレスロウ アレクサンダー
ジャヤセーナ ヌワン
カラマティアノス ジョン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced Micro Devices Inc
Original Assignee
Advanced Micro Devices Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced Micro Devices Inc filed Critical Advanced Micro Devices Inc
Publication of JP2022545644A publication Critical patent/JP2022545644A/ja
Publication of JP2022545644A5 publication Critical patent/JP2022545644A5/ja
Application granted granted Critical
Publication of JP7631308B2 publication Critical patent/JP7631308B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/74Selecting or encoding within a word the position of one or more bits having a specified value, e.g. most or least significant one or zero detection, priority encoders
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0608Saving storage space on storage systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0661Format or protocol conversion arrangements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • G06F3/0679Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/06Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
    • G06F7/08Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • H03M7/4043Adaptive prefix coding
    • H03M7/4056Coding table selection
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • H03M7/4043Adaptive prefix coding
    • H03M7/4062Coding table adaptation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
JP2022508996A 2019-08-16 2020-08-12 エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮 Active JP7631308B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/542,872 US11309911B2 (en) 2019-08-16 2019-08-16 Semi-sorting compression with encoding and decoding tables
US16/542,872 2019-08-16
PCT/US2020/045903 WO2021034565A1 (en) 2019-08-16 2020-08-12 Semi-sorting compression with encoding and decoding tables

Publications (3)

Publication Number Publication Date
JP2022545644A JP2022545644A (ja) 2022-10-28
JP2022545644A5 JP2022545644A5 (enExample) 2023-08-17
JP7631308B2 true JP7631308B2 (ja) 2025-02-18

Family

ID=74567451

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022508996A Active JP7631308B2 (ja) 2019-08-16 2020-08-12 エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮

Country Status (6)

Country Link
US (2) US11309911B2 (enExample)
EP (1) EP4014128A4 (enExample)
JP (1) JP7631308B2 (enExample)
KR (1) KR102824624B1 (enExample)
CN (1) CN114258521B (enExample)
WO (1) WO2021034565A1 (enExample)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11309911B2 (en) * 2019-08-16 2022-04-19 Advanced Micro Devices, Inc. Semi-sorting compression with encoding and decoding tables
US20240185021A1 (en) * 2022-12-01 2024-06-06 Western Digital Technologies, Inc. Pre-encoding method for dna storage

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110158323A1 (en) 2008-09-12 2011-06-30 Qu Qing Chen Method for lossless compressing prefix-suffix-codes, method for decompressing a bit sequence representing integers or symbols encoded in compressed prefix-suffix-codes and storage medium or signal carrying compressed prefix-suffix-codes

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5442630A (en) * 1991-05-24 1995-08-15 Gagliardi; Ugo O. ISDN interfacing of local area networks
US5704060A (en) 1995-05-22 1997-12-30 Del Monte; Michael G. Text storage and retrieval system and method
US6489902B2 (en) 1997-12-02 2002-12-03 Hughes Electronics Corporation Data compression for use with a communications channel
US6121903A (en) 1998-01-27 2000-09-19 Infit Communications Ltd. On-the-fly data re-compression
US6650259B1 (en) 2002-05-06 2003-11-18 Unisys Corporation Character table implemented data decompression method and apparatus
US8238923B2 (en) 2004-12-22 2012-08-07 Qualcomm Incorporated Method of using shared resources in a communication system
WO2007108395A1 (ja) * 2006-03-23 2007-09-27 Nec Corporation 可変長符号の復号装置および復号方法
US8458457B2 (en) * 2007-02-02 2013-06-04 Red Hat, Inc. Method and system for certificate revocation list pre-compression encoding
US20080317364A1 (en) * 2007-06-25 2008-12-25 Augusta Technology, Inc. Methods for determining neighboring locations for partitions of a video stream
US7609179B2 (en) 2008-01-08 2009-10-27 International Business Machines Corporation Method for compressed data with reduced dictionary sizes by coding value prefixes
TW201141081A (en) * 2009-12-29 2011-11-16 Ibm Prefix-offset encoding method for data compression
US20120110025A1 (en) 2010-10-28 2012-05-03 Qualcomm Incorporated Coding order-independent collections of words
US8606772B1 (en) 2011-01-26 2013-12-10 Trend Micro Incorporated Efficient multiple-keyword match technique with large dictionaries
KR101672107B1 (ko) 2011-11-08 2016-11-02 구글 테크놀로지 홀딩스 엘엘씨 변환 계수들에 대한 이진 코드워드들을 결정하는 방법
US10235377B2 (en) 2013-12-23 2019-03-19 Sap Se Adaptive dictionary compression/decompression for column-store databases
GB2531005A (en) * 2014-10-06 2016-04-13 Canon Kk Improved encoding process using a palette mode
US9647684B2 (en) 2014-10-21 2017-05-09 Huawei Technologies Co., Ltd. Memory-based history search
US10191682B2 (en) * 2016-09-08 2019-01-29 Qualcomm Incorporated Providing efficient lossless compression for small data blocks in processor-based systems
US10613791B2 (en) 2017-06-12 2020-04-07 Pure Storage, Inc. Portable snapshot replication between storage systems
US10838961B2 (en) * 2017-09-29 2020-11-17 Oracle International Corporation Prefix compression
US11309911B2 (en) * 2019-08-16 2022-04-19 Advanced Micro Devices, Inc. Semi-sorting compression with encoding and decoding tables

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110158323A1 (en) 2008-09-12 2011-06-30 Qu Qing Chen Method for lossless compressing prefix-suffix-codes, method for decompressing a bit sequence representing integers or symbols encoded in compressed prefix-suffix-codes and storage medium or signal carrying compressed prefix-suffix-codes
JP2012502573A (ja) 2008-09-12 2012-01-26 トムソン ライセンシング プリフィックス‐サフィックス符号をロスレス圧縮する方法、圧縮されたプリフィックス‐サフィックス符号内に符号化された整数またはシンボルを表すビットシーケンスを圧縮解除する方法、および、圧縮されたプリフィックス‐サフィックス符号を担持する記憶媒体または信号

Also Published As

Publication number Publication date
US11736119B2 (en) 2023-08-22
CN114258521B (zh) 2025-09-02
WO2021034565A1 (en) 2021-02-25
US11309911B2 (en) 2022-04-19
EP4014128A1 (en) 2022-06-22
KR20220049540A (ko) 2022-04-21
CN114258521A (zh) 2022-03-29
JP2022545644A (ja) 2022-10-28
EP4014128A4 (en) 2023-08-09
KR102824624B1 (ko) 2025-06-24
US20220239315A1 (en) 2022-07-28
US20210050864A1 (en) 2021-02-18

Similar Documents

Publication Publication Date Title
US11669521B2 (en) Accelerated filtering, grouping and aggregation in a database system
US11762828B2 (en) Cuckoo filters and cuckoo hash tables with biasing, compression, and decoupled logical sparsity
US20130141259A1 (en) Method and system for data compression
JP2014534486A (ja) スケーラブル・データ・デュプリケーションのための方法、システム、およびコンピュータ・プログラム
JP7758686B2 (ja) 基本データシーブを使用して無損失削減されたデータの効率的な取出しのための基本データの局所性の利用
RU2633178C2 (ru) Способ и система базы данных для индексирования ссылок на документы базы данных
Belazzougui et al. Cache-oblivious peeling of random hypergraphs
JP7631308B2 (ja) エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮
CN110059129A (zh) 数据存储方法、装置及电子设备
CN107852173B (zh) 对无损简化的数据执行搜索和取回的方法以及装置
US11139828B2 (en) Memory compression method and apparatus
JP2009512099A (ja) トライでの再始動可能なハッシュの方法及び装置
JP2023503034A (ja) パターンベースのキャッシュブロック圧縮
Weißenberger et al. Massively parallel inverse block-sorting transforms for bzip2 decompression on GPUs
US10749545B1 (en) Compressing tags in software and hardware semi-sorted caches
Bertola et al. A Class of Heuristics for Reducing the Number of BWT-Runs in the String Ordering Problem
Xiao et al. Highly efficient string similarity search and join over compressed indexes
Díaz-Domínguez et al. Algorithms for Computing Very Large BWTs: a Short Survey
Waidyasooriya et al. An fpga architecture for text search using a wavelet-tree-based succinct-data-structure
Liu et al. An Efficient Direct Access Algorithm for Integer Compression
JP5419069B2 (ja) データベース装置、データベースの管理方法、データベースのデータ構造、データベースの管理プログラムおよびそれを記録したコンピュータ読み取り可能な記録媒体
Pavlo Lecture# 03: Data Formats & Encoding II
Priyadarshini et al. A modified and memory saving approach to B+ tree index for search of an image database based on chain codes
Zhang Efficient Parallel Text Compression on GPUs

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220419

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230808

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20230808

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20240626

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20240709

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241001

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20250114

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20250205

R150 Certificate of patent or registration of utility model

Ref document number: 7631308

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150