JP7631308B2 - エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮 - Google Patents
エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮 Download PDFInfo
- Publication number
- JP7631308B2 JP7631308B2 JP2022508996A JP2022508996A JP7631308B2 JP 7631308 B2 JP7631308 B2 JP 7631308B2 JP 2022508996 A JP2022508996 A JP 2022508996A JP 2022508996 A JP2022508996 A JP 2022508996A JP 7631308 B2 JP7631308 B2 JP 7631308B2
- Authority
- JP
- Japan
- Prior art keywords
- prefix
- data
- encoding
- prefixes
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0638—Organizing or formatting or addressing of data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/74—Selecting or encoding within a word the position of one or more bits having a specified value, e.g. most or least significant one or zero detection, priority encoders
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0602—Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
- G06F3/0608—Saving storage space on storage systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0655—Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
- G06F3/0661—Format or protocol conversion arrangements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/0671—In-line storage system
- G06F3/0673—Single storage device
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/0671—In-line storage system
- G06F3/0673—Single storage device
- G06F3/0679—Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/06—Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
- G06F7/08—Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
- H03M7/4043—Adaptive prefix coding
- H03M7/4056—Coding table selection
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/4031—Fixed length to variable length coding
- H03M7/4037—Prefix coding
- H03M7/4043—Adaptive prefix coding
- H03M7/4062—Coding table adaptation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/542,872 US11309911B2 (en) | 2019-08-16 | 2019-08-16 | Semi-sorting compression with encoding and decoding tables |
| US16/542,872 | 2019-08-16 | ||
| PCT/US2020/045903 WO2021034565A1 (en) | 2019-08-16 | 2020-08-12 | Semi-sorting compression with encoding and decoding tables |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2022545644A JP2022545644A (ja) | 2022-10-28 |
| JP2022545644A5 JP2022545644A5 (enExample) | 2023-08-17 |
| JP7631308B2 true JP7631308B2 (ja) | 2025-02-18 |
Family
ID=74567451
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2022508996A Active JP7631308B2 (ja) | 2019-08-16 | 2020-08-12 | エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮 |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US11309911B2 (enExample) |
| EP (1) | EP4014128A4 (enExample) |
| JP (1) | JP7631308B2 (enExample) |
| KR (1) | KR102824624B1 (enExample) |
| CN (1) | CN114258521B (enExample) |
| WO (1) | WO2021034565A1 (enExample) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11309911B2 (en) * | 2019-08-16 | 2022-04-19 | Advanced Micro Devices, Inc. | Semi-sorting compression with encoding and decoding tables |
| US20240185021A1 (en) * | 2022-12-01 | 2024-06-06 | Western Digital Technologies, Inc. | Pre-encoding method for dna storage |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110158323A1 (en) | 2008-09-12 | 2011-06-30 | Qu Qing Chen | Method for lossless compressing prefix-suffix-codes, method for decompressing a bit sequence representing integers or symbols encoded in compressed prefix-suffix-codes and storage medium or signal carrying compressed prefix-suffix-codes |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5442630A (en) * | 1991-05-24 | 1995-08-15 | Gagliardi; Ugo O. | ISDN interfacing of local area networks |
| US5704060A (en) | 1995-05-22 | 1997-12-30 | Del Monte; Michael G. | Text storage and retrieval system and method |
| US6489902B2 (en) | 1997-12-02 | 2002-12-03 | Hughes Electronics Corporation | Data compression for use with a communications channel |
| US6121903A (en) | 1998-01-27 | 2000-09-19 | Infit Communications Ltd. | On-the-fly data re-compression |
| US6650259B1 (en) | 2002-05-06 | 2003-11-18 | Unisys Corporation | Character table implemented data decompression method and apparatus |
| US8238923B2 (en) | 2004-12-22 | 2012-08-07 | Qualcomm Incorporated | Method of using shared resources in a communication system |
| WO2007108395A1 (ja) * | 2006-03-23 | 2007-09-27 | Nec Corporation | 可変長符号の復号装置および復号方法 |
| US8458457B2 (en) * | 2007-02-02 | 2013-06-04 | Red Hat, Inc. | Method and system for certificate revocation list pre-compression encoding |
| US20080317364A1 (en) * | 2007-06-25 | 2008-12-25 | Augusta Technology, Inc. | Methods for determining neighboring locations for partitions of a video stream |
| US7609179B2 (en) | 2008-01-08 | 2009-10-27 | International Business Machines Corporation | Method for compressed data with reduced dictionary sizes by coding value prefixes |
| TW201141081A (en) * | 2009-12-29 | 2011-11-16 | Ibm | Prefix-offset encoding method for data compression |
| US20120110025A1 (en) | 2010-10-28 | 2012-05-03 | Qualcomm Incorporated | Coding order-independent collections of words |
| US8606772B1 (en) | 2011-01-26 | 2013-12-10 | Trend Micro Incorporated | Efficient multiple-keyword match technique with large dictionaries |
| KR101672107B1 (ko) | 2011-11-08 | 2016-11-02 | 구글 테크놀로지 홀딩스 엘엘씨 | 변환 계수들에 대한 이진 코드워드들을 결정하는 방법 |
| US10235377B2 (en) | 2013-12-23 | 2019-03-19 | Sap Se | Adaptive dictionary compression/decompression for column-store databases |
| GB2531005A (en) * | 2014-10-06 | 2016-04-13 | Canon Kk | Improved encoding process using a palette mode |
| US9647684B2 (en) | 2014-10-21 | 2017-05-09 | Huawei Technologies Co., Ltd. | Memory-based history search |
| US10191682B2 (en) * | 2016-09-08 | 2019-01-29 | Qualcomm Incorporated | Providing efficient lossless compression for small data blocks in processor-based systems |
| US10613791B2 (en) | 2017-06-12 | 2020-04-07 | Pure Storage, Inc. | Portable snapshot replication between storage systems |
| US10838961B2 (en) * | 2017-09-29 | 2020-11-17 | Oracle International Corporation | Prefix compression |
| US11309911B2 (en) * | 2019-08-16 | 2022-04-19 | Advanced Micro Devices, Inc. | Semi-sorting compression with encoding and decoding tables |
-
2019
- 2019-08-16 US US16/542,872 patent/US11309911B2/en active Active
-
2020
- 2020-08-12 CN CN202080058010.4A patent/CN114258521B/zh active Active
- 2020-08-12 KR KR1020227007999A patent/KR102824624B1/ko active Active
- 2020-08-12 EP EP20854389.2A patent/EP4014128A4/en active Pending
- 2020-08-12 JP JP2022508996A patent/JP7631308B2/ja active Active
- 2020-08-12 WO PCT/US2020/045903 patent/WO2021034565A1/en not_active Ceased
-
2022
- 2022-04-18 US US17/722,931 patent/US11736119B2/en active Active
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110158323A1 (en) | 2008-09-12 | 2011-06-30 | Qu Qing Chen | Method for lossless compressing prefix-suffix-codes, method for decompressing a bit sequence representing integers or symbols encoded in compressed prefix-suffix-codes and storage medium or signal carrying compressed prefix-suffix-codes |
| JP2012502573A (ja) | 2008-09-12 | 2012-01-26 | トムソン ライセンシング | プリフィックス‐サフィックス符号をロスレス圧縮する方法、圧縮されたプリフィックス‐サフィックス符号内に符号化された整数またはシンボルを表すビットシーケンスを圧縮解除する方法、および、圧縮されたプリフィックス‐サフィックス符号を担持する記憶媒体または信号 |
Also Published As
| Publication number | Publication date |
|---|---|
| US11736119B2 (en) | 2023-08-22 |
| CN114258521B (zh) | 2025-09-02 |
| WO2021034565A1 (en) | 2021-02-25 |
| US11309911B2 (en) | 2022-04-19 |
| EP4014128A1 (en) | 2022-06-22 |
| KR20220049540A (ko) | 2022-04-21 |
| CN114258521A (zh) | 2022-03-29 |
| JP2022545644A (ja) | 2022-10-28 |
| EP4014128A4 (en) | 2023-08-09 |
| KR102824624B1 (ko) | 2025-06-24 |
| US20220239315A1 (en) | 2022-07-28 |
| US20210050864A1 (en) | 2021-02-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11669521B2 (en) | Accelerated filtering, grouping and aggregation in a database system | |
| US11762828B2 (en) | Cuckoo filters and cuckoo hash tables with biasing, compression, and decoupled logical sparsity | |
| US20130141259A1 (en) | Method and system for data compression | |
| JP2014534486A (ja) | スケーラブル・データ・デュプリケーションのための方法、システム、およびコンピュータ・プログラム | |
| JP7758686B2 (ja) | 基本データシーブを使用して無損失削減されたデータの効率的な取出しのための基本データの局所性の利用 | |
| RU2633178C2 (ru) | Способ и система базы данных для индексирования ссылок на документы базы данных | |
| Belazzougui et al. | Cache-oblivious peeling of random hypergraphs | |
| JP7631308B2 (ja) | エンコーディング及びデコーディングテーブルを用いたセミソーティング圧縮 | |
| CN110059129A (zh) | 数据存储方法、装置及电子设备 | |
| CN107852173B (zh) | 对无损简化的数据执行搜索和取回的方法以及装置 | |
| US11139828B2 (en) | Memory compression method and apparatus | |
| JP2009512099A (ja) | トライでの再始動可能なハッシュの方法及び装置 | |
| JP2023503034A (ja) | パターンベースのキャッシュブロック圧縮 | |
| Weißenberger et al. | Massively parallel inverse block-sorting transforms for bzip2 decompression on GPUs | |
| US10749545B1 (en) | Compressing tags in software and hardware semi-sorted caches | |
| Bertola et al. | A Class of Heuristics for Reducing the Number of BWT-Runs in the String Ordering Problem | |
| Xiao et al. | Highly efficient string similarity search and join over compressed indexes | |
| Díaz-Domínguez et al. | Algorithms for Computing Very Large BWTs: a Short Survey | |
| Waidyasooriya et al. | An fpga architecture for text search using a wavelet-tree-based succinct-data-structure | |
| Liu et al. | An Efficient Direct Access Algorithm for Integer Compression | |
| JP5419069B2 (ja) | データベース装置、データベースの管理方法、データベースのデータ構造、データベースの管理プログラムおよびそれを記録したコンピュータ読み取り可能な記録媒体 | |
| Pavlo | Lecture# 03: Data Formats & Encoding II | |
| Priyadarshini et al. | A modified and memory saving approach to B+ tree index for search of an image database based on chain codes | |
| Zhang | Efficient Parallel Text Compression on GPUs |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20220419 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20230808 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20230808 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20240626 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20240709 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20241001 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20250114 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20250205 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7631308 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |