JP7848681B2 - カスタマイズ可能な区切りテキスト圧縮フレームワーク - Google Patents

カスタマイズ可能な区切りテキスト圧縮フレームワーク

Info

Publication number
JP7848681B2
JP7848681B2 JP2022522976A JP2022522976A JP7848681B2 JP 7848681 B2 JP7848681 B2 JP 7848681B2 JP 2022522976 A JP2022522976 A JP 2022522976A JP 2022522976 A JP2022522976 A JP 2022522976A JP 7848681 B2 JP7848681 B2 JP 7848681B2
Authority
JP
Japan
Prior art keywords
compression
data
schema
compressed
file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2022522976A
Other languages
English (en)
Japanese (ja)
Other versions
JP2023501093A5 (https=
JP2023501093A (ja
Inventor
イー ヒム チャン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips NV filed Critical Koninklijke Philips NV
Publication of JP2023501093A publication Critical patent/JP2023501093A/ja
Publication of JP2023501093A5 publication Critical patent/JP2023501093A5/ja
Application granted granted Critical
Publication of JP7848681B2 publication Critical patent/JP7848681B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/173Customisation support for file systems, e.g. localisation, multi-language support, personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/174Redundancy elimination performed by the file system
    • G06F16/1744Redundancy elimination performed by the file system using compression, e.g. sparse files
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/123Storage facilities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/131Fragmentation of text files, e.g. creating reusable text-blocks; Linking to fragments, e.g. using XInclude; Namespaces
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/183Tabulation, i.e. one-dimensional [1D] positioning
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/50Compression of genetic data
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/60General implementation details not specific to a particular type of compression
    • H03M7/6064Selection of Compressor
    • H03M7/607Selection between different types of compressors
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/70Type of the data to be coded, other than image and sound
    • H03M7/707Structured documents, e.g. XML

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioethics (AREA)
  • Biophysics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Genetics & Genomics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Document Processing Apparatus (AREA)
JP2022522976A 2019-10-18 2020-10-15 カスタマイズ可能な区切りテキスト圧縮フレームワーク Active JP7848681B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201962923113P 2019-10-18 2019-10-18
US62/923,113 2019-10-18
US202062956941P 2020-01-03 2020-01-03
US62/956,941 2020-01-03
PCT/EP2020/078996 WO2021074272A1 (en) 2019-10-18 2020-10-15 Customizable delimited text compression framework

Publications (3)

Publication Number Publication Date
JP2023501093A JP2023501093A (ja) 2023-01-18
JP2023501093A5 JP2023501093A5 (https=) 2023-10-23
JP7848681B2 true JP7848681B2 (ja) 2026-04-21

Family

ID=72964653

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022522976A Active JP7848681B2 (ja) 2019-10-18 2020-10-15 カスタマイズ可能な区切りテキスト圧縮フレームワーク

Country Status (7)

Country Link
US (1) US20240095218A1 (https=)
EP (1) EP4046052A1 (https=)
JP (1) JP7848681B2 (https=)
CN (1) CN114556318A (https=)
BR (1) BR112022007396A2 (https=)
CA (1) CA3157786A1 (https=)
WO (1) WO2021074272A1 (https=)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102948214B1 (ko) * 2021-07-16 2026-04-03 주식회사 쏠리드 프론트홀 다중화 장치
US12387053B2 (en) 2022-01-27 2025-08-12 International Business Machines Corporation Large-scale text data encoding and compression
CN117827775A (zh) * 2022-09-29 2024-04-05 华为技术有限公司 数据压缩方法、装置、计算设备及存储系统
CN116521063B (zh) * 2023-03-31 2024-03-26 北京瑞风协同科技股份有限公司 一种hdf5的试验数据高效读写方法及装置
CN119166428B (zh) * 2024-11-21 2025-10-17 北京高阳捷迅信息技术有限公司 基于大数据的关系型数据库备份恢复方法及系统

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2018502348A (ja) 2014-09-03 2018-01-25 スン−シオン,パトリック 合成ゲノム変異体ベースの安全なトランザクション装置、システム、及び方法
WO2018068827A1 (en) 2016-10-11 2018-04-19 Genomsys Sa Efficient data structures for bioinformatics information representation
WO2018068830A1 (en) 2016-10-11 2018-04-19 Genomsys Sa Method and system for the transmission of bioinformatics data

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2283591C (en) * 1997-03-07 2006-01-31 Intelligent Compression Technologies Data coding network
JP2000252832A (ja) * 1999-02-25 2000-09-14 Nikon Corp データ圧縮装置、およびデータ圧縮プログラムを記録した記録媒体
JP2005018672A (ja) * 2003-06-30 2005-01-20 Hitachi Ltd 構造化文書の圧縮方法
GB2412978A (en) * 2004-04-07 2005-10-12 Hewlett Packard Development Co Method and system for compressing and decompressing hierarchical data structures
US9667269B2 (en) * 2009-04-30 2017-05-30 Oracle International Corporation Technique for compressing XML indexes
JP5280425B2 (ja) * 2010-11-12 2013-09-04 シャープ株式会社 画像処理装置、画像読取装置、画像形成装置、画像処理方法、プログラムおよびその記録媒体
KR101922129B1 (ko) * 2011-12-05 2018-11-26 삼성전자주식회사 차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2018502348A (ja) 2014-09-03 2018-01-25 スン−シオン,パトリック 合成ゲノム変異体ベースの安全なトランザクション装置、システム、及び方法
WO2018068827A1 (en) 2016-10-11 2018-04-19 Genomsys Sa Efficient data structures for bioinformatics information representation
WO2018068830A1 (en) 2016-10-11 2018-04-19 Genomsys Sa Method and system for the transmission of bioinformatics data

Also Published As

Publication number Publication date
JP2023501093A (ja) 2023-01-18
BR112022007396A2 (pt) 2022-07-05
EP4046052A1 (en) 2022-08-24
CN114556318A (zh) 2022-05-27
US20240095218A1 (en) 2024-03-21
CA3157786A1 (en) 2021-04-22
WO2021074272A1 (en) 2021-04-22

Similar Documents

Publication Publication Date Title
JP7848681B2 (ja) カスタマイズ可能な区切りテキスト圧縮フレームワーク
US10778441B2 (en) Redactable document signatures
US11916576B2 (en) System and method for effective compression, representation and decompression of diverse tabulated data
Delcher et al. Using MUMmer to identify similar regions in large sequence sets
JP6902104B2 (ja) バイオインフォマティクス情報表示のための効率的データ構造
Holley et al. Bloom filter trie–a data structure for pan-genome storage
US10956659B1 (en) System for generating templates from webpages
CN112889039B (zh) 用于克隆后租户标识符转换的记录的标识
Wilke et al. MG-RAST manual for version 4, revision 3
CN118692573A (zh) 一种基因型数据压缩及检索方法、装置、设备及计算机可读存储介质
CN115002243B (zh) 一种数据处理方法及装置
US12445148B2 (en) System and method for effective compression representation and decompression of diverse tabulated data
JP5500968B2 (ja) 情報処理装置、情報処理方法、及び情報処理プログラム
CN116522116B (zh) Pe文件的分类特征的生成方法及电子设备、存储介质
WO2026093099A1 (en) Processing a genomic data structure with splice information
HK40082649A (en) Efficient data structures for bioinformatics information representation
WO2011099104A1 (ja) ファイル名管理方法及びファイル名管理装置
HK40009794A (en) Efficient data structures for bioinformatics information representation
HK40009794B (en) Efficient data structures for bioinformatics information representation
NZ753247B2 (en) Efficient data structures for bioinformatics information representation

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20231012

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20231012

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20240808

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20240820

RD03 Notification of appointment of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7423

Effective date: 20240822

RD04 Notification of resignation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7424

Effective date: 20240826

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241111

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20250306

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20250603

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250826

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20251007

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20260126

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20260310

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20260323

R150 Certificate of patent or registration of utility model

Ref document number: 7848681

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150