JP7848681B2 - カスタマイズ可能な区切りテキスト圧縮フレームワーク - Google Patents
カスタマイズ可能な区切りテキスト圧縮フレームワークInfo
- Publication number
- JP7848681B2 JP7848681B2 JP2022522976A JP2022522976A JP7848681B2 JP 7848681 B2 JP7848681 B2 JP 7848681B2 JP 2022522976 A JP2022522976 A JP 2022522976A JP 2022522976 A JP2022522976 A JP 2022522976A JP 7848681 B2 JP7848681 B2 JP 7848681B2
- Authority
- JP
- Japan
- Prior art keywords
- compression
- data
- schema
- compressed
- file
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/173—Customisation support for file systems, e.g. localisation, multi-language support, personalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/174—Redundancy elimination performed by the file system
- G06F16/1744—Redundancy elimination performed by the file system using compression, e.g. sparse files
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/123—Storage facilities
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/131—Fragmentation of text files, e.g. creating reusable text-blocks; Linking to fragments, e.g. using XInclude; Namespaces
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/183—Tabulation, i.e. one-dimensional [1D] positioning
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/50—Compression of genetic data
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/60—General implementation details not specific to a particular type of compression
- H03M7/6064—Selection of Compressor
- H03M7/607—Selection between different types of compressors
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/70—Type of the data to be coded, other than image and sound
- H03M7/707—Structured documents, e.g. XML
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioethics (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Genetics & Genomics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201962923113P | 2019-10-18 | 2019-10-18 | |
| US62/923,113 | 2019-10-18 | ||
| US202062956941P | 2020-01-03 | 2020-01-03 | |
| US62/956,941 | 2020-01-03 | ||
| PCT/EP2020/078996 WO2021074272A1 (en) | 2019-10-18 | 2020-10-15 | Customizable delimited text compression framework |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2023501093A JP2023501093A (ja) | 2023-01-18 |
| JP2023501093A5 JP2023501093A5 (https=) | 2023-10-23 |
| JP7848681B2 true JP7848681B2 (ja) | 2026-04-21 |
Family
ID=72964653
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2022522976A Active JP7848681B2 (ja) | 2019-10-18 | 2020-10-15 | カスタマイズ可能な区切りテキスト圧縮フレームワーク |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20240095218A1 (https=) |
| EP (1) | EP4046052A1 (https=) |
| JP (1) | JP7848681B2 (https=) |
| CN (1) | CN114556318A (https=) |
| BR (1) | BR112022007396A2 (https=) |
| CA (1) | CA3157786A1 (https=) |
| WO (1) | WO2021074272A1 (https=) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR102948214B1 (ko) * | 2021-07-16 | 2026-04-03 | 주식회사 쏠리드 | 프론트홀 다중화 장치 |
| US12387053B2 (en) | 2022-01-27 | 2025-08-12 | International Business Machines Corporation | Large-scale text data encoding and compression |
| CN117827775A (zh) * | 2022-09-29 | 2024-04-05 | 华为技术有限公司 | 数据压缩方法、装置、计算设备及存储系统 |
| CN116521063B (zh) * | 2023-03-31 | 2024-03-26 | 北京瑞风协同科技股份有限公司 | 一种hdf5的试验数据高效读写方法及装置 |
| CN119166428B (zh) * | 2024-11-21 | 2025-10-17 | 北京高阳捷迅信息技术有限公司 | 基于大数据的关系型数据库备份恢复方法及系统 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2018502348A (ja) | 2014-09-03 | 2018-01-25 | スン−シオン,パトリック | 合成ゲノム変異体ベースの安全なトランザクション装置、システム、及び方法 |
| WO2018068827A1 (en) | 2016-10-11 | 2018-04-19 | Genomsys Sa | Efficient data structures for bioinformatics information representation |
| WO2018068830A1 (en) | 2016-10-11 | 2018-04-19 | Genomsys Sa | Method and system for the transmission of bioinformatics data |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2283591C (en) * | 1997-03-07 | 2006-01-31 | Intelligent Compression Technologies | Data coding network |
| JP2000252832A (ja) * | 1999-02-25 | 2000-09-14 | Nikon Corp | データ圧縮装置、およびデータ圧縮プログラムを記録した記録媒体 |
| JP2005018672A (ja) * | 2003-06-30 | 2005-01-20 | Hitachi Ltd | 構造化文書の圧縮方法 |
| GB2412978A (en) * | 2004-04-07 | 2005-10-12 | Hewlett Packard Development Co | Method and system for compressing and decompressing hierarchical data structures |
| US9667269B2 (en) * | 2009-04-30 | 2017-05-30 | Oracle International Corporation | Technique for compressing XML indexes |
| JP5280425B2 (ja) * | 2010-11-12 | 2013-09-04 | シャープ株式会社 | 画像処理装置、画像読取装置、画像形成装置、画像処理方法、プログラムおよびその記録媒体 |
| KR101922129B1 (ko) * | 2011-12-05 | 2018-11-26 | 삼성전자주식회사 | 차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치 |
-
2020
- 2020-10-15 JP JP2022522976A patent/JP7848681B2/ja active Active
- 2020-10-15 CA CA3157786A patent/CA3157786A1/en active Pending
- 2020-10-15 BR BR112022007396A patent/BR112022007396A2/pt unknown
- 2020-10-15 CN CN202080073005.0A patent/CN114556318A/zh active Pending
- 2020-10-15 US US17/768,878 patent/US20240095218A1/en active Pending
- 2020-10-15 EP EP20793605.5A patent/EP4046052A1/en active Pending
- 2020-10-15 WO PCT/EP2020/078996 patent/WO2021074272A1/en not_active Ceased
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2018502348A (ja) | 2014-09-03 | 2018-01-25 | スン−シオン,パトリック | 合成ゲノム変異体ベースの安全なトランザクション装置、システム、及び方法 |
| WO2018068827A1 (en) | 2016-10-11 | 2018-04-19 | Genomsys Sa | Efficient data structures for bioinformatics information representation |
| WO2018068830A1 (en) | 2016-10-11 | 2018-04-19 | Genomsys Sa | Method and system for the transmission of bioinformatics data |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2023501093A (ja) | 2023-01-18 |
| BR112022007396A2 (pt) | 2022-07-05 |
| EP4046052A1 (en) | 2022-08-24 |
| CN114556318A (zh) | 2022-05-27 |
| US20240095218A1 (en) | 2024-03-21 |
| CA3157786A1 (en) | 2021-04-22 |
| WO2021074272A1 (en) | 2021-04-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7848681B2 (ja) | カスタマイズ可能な区切りテキスト圧縮フレームワーク | |
| US10778441B2 (en) | Redactable document signatures | |
| US11916576B2 (en) | System and method for effective compression, representation and decompression of diverse tabulated data | |
| Delcher et al. | Using MUMmer to identify similar regions in large sequence sets | |
| JP6902104B2 (ja) | バイオインフォマティクス情報表示のための効率的データ構造 | |
| Holley et al. | Bloom filter trie–a data structure for pan-genome storage | |
| US10956659B1 (en) | System for generating templates from webpages | |
| CN112889039B (zh) | 用于克隆后租户标识符转换的记录的标识 | |
| Wilke et al. | MG-RAST manual for version 4, revision 3 | |
| CN118692573A (zh) | 一种基因型数据压缩及检索方法、装置、设备及计算机可读存储介质 | |
| CN115002243B (zh) | 一种数据处理方法及装置 | |
| US12445148B2 (en) | System and method for effective compression representation and decompression of diverse tabulated data | |
| JP5500968B2 (ja) | 情報処理装置、情報処理方法、及び情報処理プログラム | |
| CN116522116B (zh) | Pe文件的分类特征的生成方法及电子设备、存储介质 | |
| WO2026093099A1 (en) | Processing a genomic data structure with splice information | |
| HK40082649A (en) | Efficient data structures for bioinformatics information representation | |
| WO2011099104A1 (ja) | ファイル名管理方法及びファイル名管理装置 | |
| HK40009794A (en) | Efficient data structures for bioinformatics information representation | |
| HK40009794B (en) | Efficient data structures for bioinformatics information representation | |
| NZ753247B2 (en) | Efficient data structures for bioinformatics information representation |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20231012 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20231012 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20240808 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20240820 |
|
| RD03 | Notification of appointment of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7423 Effective date: 20240822 |
|
| RD04 | Notification of resignation of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7424 Effective date: 20240826 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20241111 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20250306 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20250603 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20250826 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20251007 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20260126 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20260310 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20260323 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7848681 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |