BR112022007396A2 - Método para acesso seletivo de dados, método e sistema para compactação de dados - Google Patents

Método para acesso seletivo de dados, método e sistema para compactação de dados

Info

Publication number
BR112022007396A2
BR112022007396A2 BR112022007396A BR112022007396A BR112022007396A2 BR 112022007396 A2 BR112022007396 A2 BR 112022007396A2 BR 112022007396 A BR112022007396 A BR 112022007396A BR 112022007396 A BR112022007396 A BR 112022007396A BR 112022007396 A2 BR112022007396 A2 BR 112022007396A2
Authority
BR
Brazil
Prior art keywords
data
compression
blocks
file
data blocks
Prior art date
Application number
BR112022007396A
Other languages
English (en)
Inventor
Him Cheung Yee
Original Assignee
Koninklijke Philips Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Nv filed Critical Koninklijke Philips Nv
Publication of BR112022007396A2 publication Critical patent/BR112022007396A2/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/173Customisation support for file systems, e.g. localisation, multi-language support, personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/174Redundancy elimination performed by the file system
    • G06F16/1744Redundancy elimination performed by the file system using compression, e.g. sparse files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/123Storage facilities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/131Fragmentation of text files, e.g. creating reusable text-blocks; Linking to fragments, e.g. using XInclude; Namespaces
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/183Tabulation, i.e. one-dimensional positioning
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/50Compression of genetic data
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/60General implementation details not specific to a particular type of compression
    • H03M7/6064Selection of Compressor
    • H03M7/607Selection between different types of compressors
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/70Type of the data to be coded, other than image and sound
    • H03M7/707Structured documents, e.g. XML

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioethics (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

MÉTODO PARA ACESSO SELETIVO DE DADOS, MÉTODO E SISTEMA PARA COMPACTAÇÃO DE DADOS. A presente invenção refere-se a um método para compactar dados que inclui obter um esquema de compactação personalizado para um formato de um arquivo de texto delimitado, e ao usar o esquema de compactação para analisar o arquivo de texto delimitado em uma pluralidade de blocos de dados, dividir cada um dos blocos de dados em uma pluralidade de unidades de dados para acesso seletivo eficiente, e compactar a pluralidade de unidades de dados na pluralidade de blocos de dados com o uso de diferentes algoritmos de compactação para uma razão de compactação aprimorada. O arquivo delimitado é dividido em uma pluralidade de blocos de dados com base nas definições de região no esquema. Cada um dentre a pluralidade de blocos de dados é dividido na pluralidade de unidades de dados com base em seu respectivo tamanho de unidade de dados especificado no esquema. A pluralidade de unidades de dados em cada bloco de dados da pluralidade de blocos de dados é compactada com o uso dos diferentes algoritmos de compactação indicados pelas instruções de compactação no esquema. O arquivo compactado consiste em blocos de dados compactados, um esquema de compactação e vários metadados para descompactação de dados, reconstrução de arquivos e funcionalidades como segurança de dados e solicitação de pesquisa. O arquivo de texto delimitado pode incluir informações genômicas ou outro tipo de informação.
BR112022007396A 2019-10-18 2020-10-15 Método para acesso seletivo de dados, método e sistema para compactação de dados BR112022007396A2 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201962923113P 2019-10-18 2019-10-18
US202062956941P 2020-01-03 2020-01-03
PCT/EP2020/078996 WO2021074272A1 (en) 2019-10-18 2020-10-15 Customizable delimited text compression framework

Publications (1)

Publication Number Publication Date
BR112022007396A2 true BR112022007396A2 (pt) 2022-07-05

Family

ID=72964653

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112022007396A BR112022007396A2 (pt) 2019-10-18 2020-10-15 Método para acesso seletivo de dados, método e sistema para compactação de dados

Country Status (7)

Country Link
US (1) US20240095218A1 (pt)
EP (1) EP4046052A1 (pt)
JP (1) JP2023501093A (pt)
CN (1) CN114556318A (pt)
BR (1) BR112022007396A2 (pt)
CA (1) CA3157786A1 (pt)
WO (1) WO2021074272A1 (pt)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116521063B (zh) * 2023-03-31 2024-03-26 北京瑞风协同科技股份有限公司 一种hdf5的试验数据高效读写方法及装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2283591C (en) * 1997-03-07 2006-01-31 Intelligent Compression Technologies Data coding network
KR101922129B1 (ko) * 2011-12-05 2018-11-26 삼성전자주식회사 차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치

Also Published As

Publication number Publication date
JP2023501093A (ja) 2023-01-18
CA3157786A1 (en) 2021-04-22
CN114556318A (zh) 2022-05-27
US20240095218A1 (en) 2024-03-21
WO2021074272A1 (en) 2021-04-22
EP4046052A1 (en) 2022-08-24

Similar Documents

Publication Publication Date Title
BR112018016787A2 (pt) armazenamento de vídeo de realidade virtual em arquivos de mídia
BR112019005438A2 (pt) método e sistema de dupla anonimização de dados
BR112015003406A8 (pt) Método implementado por computador e sistema de computação
BR112018077198A2 (pt) sistemas e métodos para identificar conteúdos correspondentes
CL2016000958A1 (es) Métodos de procesamiento de datos de video de múltiples capas para facilitar el acceso aleatorio y el cambio de capa que comprenden generar un archivo que comprende una caja de pista que contiene metadatos, en donde unos datos de medios para la pista comprenden una secuencia de muestras; y dispositivos de video.
BR112014030110A8 (pt) métodos e aparelhos para coletar informação de usuário distribuída para impressões de mídia e termos de pesquisa
BR112022007396A2 (pt) Método para acesso seletivo de dados, método e sistema para compactação de dados
PH12019500791A1 (en) Efficient data structures for bioinformatics information presentation
BR102014027639A8 (pt) método para resolver as entidades de uma pluralidade de documentos, e sistema de resolução de entidade para a resolução de entidade de uma pluralidade de documentos
BR112016013587A8 (pt) método, dispositivo sem fio e sistema para sintetização de página da rede em dispositivos sem fio
Hurd Finding No Fault with Negligence
BR112015031171A2 (pt) obtenção de um ligante terapêutico melhorado
Zouhair et al. Contagion versus interdependence: The case of the BRIC Countries during the subprime crises
Martens Sarapis as Healer in Roman Athens: Reconsidering the Identity of Agora S 1068
Mocnik et al. The effect of tectonic plate motion on OpenStreetMap data
Katsela The city logistics-based business model: a series of components
Ruan A statistical method for rare variants association studies in pedigree data
Lenneis Reconstruction of domestic units based upon distribution analysis and study of the finds density in pit fills
Jarlert Political Reform in Sweden
Urkin et al. Challenges in the provision of health to the rural Bedouin population in southern Israel
Lupton Foreword: Social and Cultural Perspectives on Health, Technology and Medicine
Celis et al. Parliamentary Bodies and the Quality of Women’s Substantive Representation: a comparative analysis of UK and Belgian women’s parliamentary bodies.
Cahyani et al. The Importance of Preserving Tacit Knowledge for Natural Disaster Casualties Anticipation
Possing Representing Gendered Individualities: Reflections on the Biographical Turn
Rodriguez et al. Available Resources for Reconfigurable Systems in 5G Networks