KR102824624B1 - 인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축 - Google Patents

인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축 Download PDF

Info

Publication number
KR102824624B1
KR102824624B1 KR1020227007999A KR20227007999A KR102824624B1 KR 102824624 B1 KR102824624 B1 KR 102824624B1 KR 1020227007999 A KR1020227007999 A KR 1020227007999A KR 20227007999 A KR20227007999 A KR 20227007999A KR 102824624 B1 KR102824624 B1 KR 102824624B1
Authority
KR
South Korea
Prior art keywords
prefix
data
prefixes
delete delete
data items
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020227007999A
Other languages
English (en)
Korean (ko)
Other versions
KR20220049540A (ko
Inventor
알렉산더 디. 브리슬로우
누완 자야세나
존 칼라마티아노스
Original Assignee
어드밴스드 마이크로 디바이시즈, 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 어드밴스드 마이크로 디바이시즈, 인코포레이티드 filed Critical 어드밴스드 마이크로 디바이시즈, 인코포레이티드
Publication of KR20220049540A publication Critical patent/KR20220049540A/ko
Application granted granted Critical
Publication of KR102824624B1 publication Critical patent/KR102824624B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/74Selecting or encoding within a word the position of one or more bits having a specified value, e.g. most or least significant one or zero detection, priority encoders
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0608Saving storage space on storage systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0661Format or protocol conversion arrangements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • G06F3/0679Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/06Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
    • G06F7/08Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • H03M7/4043Adaptive prefix coding
    • H03M7/4056Coding table selection
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4031Fixed length to variable length coding
    • H03M7/4037Prefix coding
    • H03M7/4043Adaptive prefix coding
    • H03M7/4062Coding table adaptation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
KR1020227007999A 2019-08-16 2020-08-12 인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축 Active KR102824624B1 (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/542,872 US11309911B2 (en) 2019-08-16 2019-08-16 Semi-sorting compression with encoding and decoding tables
US16/542,872 2019-08-16
PCT/US2020/045903 WO2021034565A1 (en) 2019-08-16 2020-08-12 Semi-sorting compression with encoding and decoding tables

Publications (2)

Publication Number Publication Date
KR20220049540A KR20220049540A (ko) 2022-04-21
KR102824624B1 true KR102824624B1 (ko) 2025-06-24

Family

ID=74567451

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020227007999A Active KR102824624B1 (ko) 2019-08-16 2020-08-12 인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축

Country Status (6)

Country Link
US (2) US11309911B2 (enExample)
EP (1) EP4014128A4 (enExample)
JP (1) JP7631308B2 (enExample)
KR (1) KR102824624B1 (enExample)
CN (1) CN114258521B (enExample)
WO (1) WO2021034565A1 (enExample)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11309911B2 (en) * 2019-08-16 2022-04-19 Advanced Micro Devices, Inc. Semi-sorting compression with encoding and decoding tables
US20240185021A1 (en) * 2022-12-01 2024-06-06 Western Digital Technologies, Inc. Pre-encoding method for dna storage

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5442630A (en) * 1991-05-24 1995-08-15 Gagliardi; Ugo O. ISDN interfacing of local area networks
US5704060A (en) 1995-05-22 1997-12-30 Del Monte; Michael G. Text storage and retrieval system and method
US6489902B2 (en) 1997-12-02 2002-12-03 Hughes Electronics Corporation Data compression for use with a communications channel
US6121903A (en) 1998-01-27 2000-09-19 Infit Communications Ltd. On-the-fly data re-compression
US6650259B1 (en) 2002-05-06 2003-11-18 Unisys Corporation Character table implemented data decompression method and apparatus
US8238923B2 (en) 2004-12-22 2012-08-07 Qualcomm Incorporated Method of using shared resources in a communication system
WO2007108395A1 (ja) * 2006-03-23 2007-09-27 Nec Corporation 可変長符号の復号装置および復号方法
US8458457B2 (en) * 2007-02-02 2013-06-04 Red Hat, Inc. Method and system for certificate revocation list pre-compression encoding
US20080317364A1 (en) * 2007-06-25 2008-12-25 Augusta Technology, Inc. Methods for determining neighboring locations for partitions of a video stream
US7609179B2 (en) 2008-01-08 2009-10-27 International Business Machines Corporation Method for compressed data with reduced dictionary sizes by coding value prefixes
EP2164176A1 (en) * 2008-09-12 2010-03-17 Thomson Licensing Method for lossless compressing prefix-suffix-codes, method for decompressing a bit sequence representing integers or symbols encoded in compressed prefix-suffix-codes and storage medium or signal carrying compressed prefix-suffix-codes
TW201141081A (en) * 2009-12-29 2011-11-16 Ibm Prefix-offset encoding method for data compression
US20120110025A1 (en) 2010-10-28 2012-05-03 Qualcomm Incorporated Coding order-independent collections of words
US8606772B1 (en) 2011-01-26 2013-12-10 Trend Micro Incorporated Efficient multiple-keyword match technique with large dictionaries
KR101672107B1 (ko) 2011-11-08 2016-11-02 구글 테크놀로지 홀딩스 엘엘씨 변환 계수들에 대한 이진 코드워드들을 결정하는 방법
US10235377B2 (en) 2013-12-23 2019-03-19 Sap Se Adaptive dictionary compression/decompression for column-store databases
GB2531005A (en) * 2014-10-06 2016-04-13 Canon Kk Improved encoding process using a palette mode
US9647684B2 (en) 2014-10-21 2017-05-09 Huawei Technologies Co., Ltd. Memory-based history search
US10191682B2 (en) * 2016-09-08 2019-01-29 Qualcomm Incorporated Providing efficient lossless compression for small data blocks in processor-based systems
US10613791B2 (en) 2017-06-12 2020-04-07 Pure Storage, Inc. Portable snapshot replication between storage systems
US10838961B2 (en) * 2017-09-29 2020-11-17 Oracle International Corporation Prefix compression
US11309911B2 (en) * 2019-08-16 2022-04-19 Advanced Micro Devices, Inc. Semi-sorting compression with encoding and decoding tables

Also Published As

Publication number Publication date
JP7631308B2 (ja) 2025-02-18
US11736119B2 (en) 2023-08-22
CN114258521B (zh) 2025-09-02
WO2021034565A1 (en) 2021-02-25
US11309911B2 (en) 2022-04-19
EP4014128A1 (en) 2022-06-22
KR20220049540A (ko) 2022-04-21
CN114258521A (zh) 2022-03-29
JP2022545644A (ja) 2022-10-28
EP4014128A4 (en) 2023-08-09
US20220239315A1 (en) 2022-07-28
US20210050864A1 (en) 2021-02-18

Similar Documents

Publication Publication Date Title
Pibiri et al. Techniques for inverted index compression
US20200387488A1 (en) Space efficient vector for columnar data storage
US11762828B2 (en) Cuckoo filters and cuckoo hash tables with biasing, compression, and decoupled logical sparsity
CN101937448B (zh) 用于主存储器列存储装置的基于字典的保持顺序的串压缩
US11669521B2 (en) Accelerated filtering, grouping and aggregation in a database system
JP2014534486A (ja) スケーラブル・データ・デュプリケーションのための方法、システム、およびコンピュータ・プログラム
RU2633178C2 (ru) Способ и система базы данных для индексирования ссылок на документы базы данных
Klein et al. Parallel lempel ziv coding
CN103914483B (zh) 文件存储方法、装置及文件读取方法、装置
Qiao et al. An FPGA-based BWT accelerator for Bzip2 data compression
KR102824624B1 (ko) 인코딩 및 디코딩 테이블을 이용한 세미 소팅 압축
Jansson et al. CRAM: Compressed random access memory
US12001339B2 (en) Dictionary compression device and memory system
US10749545B1 (en) Compressing tags in software and hardware semi-sorted caches
Bertola et al. A Class of Heuristics for Reducing the Number of BWT-Runs in the String Ordering Problem
Culpepper et al. Revisiting bounded context block‐sorting transformations
CN115495479A (zh) 字段可变的数据查询方法和装置
US12489461B2 (en) Compression device and compression method
Díaz-Domínguez et al. Algorithms for Computing Very Large BWTs: a Short Survey
US12298952B1 (en) Multiple pass sort with subset splitting
Liu et al. An Efficient Direct Access Algorithm for Integer Compression
JP5419069B2 (ja) データベース装置、データベースの管理方法、データベースのデータ構造、データベースの管理プログラムおよびそれを記録したコンピュータ読み取り可能な記録媒体
Pavlo Lecture# 03: Data Formats & Encoding II

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20220310

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

Comment text: Notification of reason for refusal

Patent event date: 20240719

Patent event code: PE09021S01D

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

Patent event code: PE07011S01D

Comment text: Decision to Grant Registration

Patent event date: 20250412

GRNT Written decision to grant
PR0701 Registration of establishment

Comment text: Registration of Establishment

Patent event date: 20250619

Patent event code: PR07011E01D

PR1002 Payment of registration fee

Payment date: 20250620

End annual number: 3

Start annual number: 1

PG1601 Publication of registration