BR112022007396A2 - Método para acesso seletivo de dados, método e sistema para compactação de dados - Google Patents
Método para acesso seletivo de dados, método e sistema para compactação de dadosInfo
- Publication number
- BR112022007396A2 BR112022007396A2 BR112022007396A BR112022007396A BR112022007396A2 BR 112022007396 A2 BR112022007396 A2 BR 112022007396A2 BR 112022007396 A BR112022007396 A BR 112022007396A BR 112022007396 A BR112022007396 A BR 112022007396A BR 112022007396 A2 BR112022007396 A2 BR 112022007396A2
- Authority
- BR
- Brazil
- Prior art keywords
- data
- compression
- blocks
- file
- data blocks
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 5
- 238000005056 compaction Methods 0.000 title 1
- 238000007906 compression Methods 0.000 abstract 7
- 230000006835 compression Effects 0.000 abstract 7
- 238000013144 data compression Methods 0.000 abstract 1
- 230000006837 decompression Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/173—Customisation support for file systems, e.g. localisation, multi-language support, personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/174—Redundancy elimination performed by the file system
- G06F16/1744—Redundancy elimination performed by the file system using compression, e.g. sparse files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/123—Storage facilities
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/131—Fragmentation of text files, e.g. creating reusable text-blocks; Linking to fragments, e.g. using XInclude; Namespaces
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/183—Tabulation, i.e. one-dimensional positioning
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/50—Compression of genetic data
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/60—General implementation details not specific to a particular type of compression
- H03M7/6064—Selection of Compressor
- H03M7/607—Selection between different types of compressors
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/70—Type of the data to be coded, other than image and sound
- H03M7/707—Structured documents, e.g. XML
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioethics (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Evolutionary Biology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Document Processing Apparatus (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
MÉTODO PARA ACESSO SELETIVO DE DADOS, MÉTODO E SISTEMA PARA COMPACTAÇÃO DE DADOS. A presente invenção refere-se a um método para compactar dados que inclui obter um esquema de compactação personalizado para um formato de um arquivo de texto delimitado, e ao usar o esquema de compactação para analisar o arquivo de texto delimitado em uma pluralidade de blocos de dados, dividir cada um dos blocos de dados em uma pluralidade de unidades de dados para acesso seletivo eficiente, e compactar a pluralidade de unidades de dados na pluralidade de blocos de dados com o uso de diferentes algoritmos de compactação para uma razão de compactação aprimorada. O arquivo delimitado é dividido em uma pluralidade de blocos de dados com base nas definições de região no esquema. Cada um dentre a pluralidade de blocos de dados é dividido na pluralidade de unidades de dados com base em seu respectivo tamanho de unidade de dados especificado no esquema. A pluralidade de unidades de dados em cada bloco de dados da pluralidade de blocos de dados é compactada com o uso dos diferentes algoritmos de compactação indicados pelas instruções de compactação no esquema. O arquivo compactado consiste em blocos de dados compactados, um esquema de compactação e vários metadados para descompactação de dados, reconstrução de arquivos e funcionalidades como segurança de dados e solicitação de pesquisa. O arquivo de texto delimitado pode incluir informações genômicas ou outro tipo de informação.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962923113P | 2019-10-18 | 2019-10-18 | |
US202062956941P | 2020-01-03 | 2020-01-03 | |
PCT/EP2020/078996 WO2021074272A1 (en) | 2019-10-18 | 2020-10-15 | Customizable delimited text compression framework |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112022007396A2 true BR112022007396A2 (pt) | 2022-07-05 |
Family
ID=72964653
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112022007396A BR112022007396A2 (pt) | 2019-10-18 | 2020-10-15 | Método para acesso seletivo de dados, método e sistema para compactação de dados |
Country Status (7)
Country | Link |
---|---|
US (1) | US20240095218A1 (pt) |
EP (1) | EP4046052A1 (pt) |
JP (1) | JP2023501093A (pt) |
CN (1) | CN114556318A (pt) |
BR (1) | BR112022007396A2 (pt) |
CA (1) | CA3157786A1 (pt) |
WO (1) | WO2021074272A1 (pt) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116521063B (zh) * | 2023-03-31 | 2024-03-26 | 北京瑞风协同科技股份有限公司 | 一种hdf5的试验数据高效读写方法及装置 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2283591C (en) * | 1997-03-07 | 2006-01-31 | Intelligent Compression Technologies | Data coding network |
KR101922129B1 (ko) * | 2011-12-05 | 2018-11-26 | 삼성전자주식회사 | 차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치 |
-
2020
- 2020-10-15 BR BR112022007396A patent/BR112022007396A2/pt unknown
- 2020-10-15 CN CN202080073005.0A patent/CN114556318A/zh active Pending
- 2020-10-15 CA CA3157786A patent/CA3157786A1/en active Pending
- 2020-10-15 EP EP20793605.5A patent/EP4046052A1/en active Pending
- 2020-10-15 US US17/768,878 patent/US20240095218A1/en active Pending
- 2020-10-15 WO PCT/EP2020/078996 patent/WO2021074272A1/en active Application Filing
- 2020-10-15 JP JP2022522976A patent/JP2023501093A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
JP2023501093A (ja) | 2023-01-18 |
CA3157786A1 (en) | 2021-04-22 |
CN114556318A (zh) | 2022-05-27 |
US20240095218A1 (en) | 2024-03-21 |
WO2021074272A1 (en) | 2021-04-22 |
EP4046052A1 (en) | 2022-08-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112018016787A2 (pt) | armazenamento de vídeo de realidade virtual em arquivos de mídia | |
BR112019005438A2 (pt) | método e sistema de dupla anonimização de dados | |
BR112015003406A8 (pt) | Método implementado por computador e sistema de computação | |
BR112018077198A2 (pt) | sistemas e métodos para identificar conteúdos correspondentes | |
CL2016000958A1 (es) | Métodos de procesamiento de datos de video de múltiples capas para facilitar el acceso aleatorio y el cambio de capa que comprenden generar un archivo que comprende una caja de pista que contiene metadatos, en donde unos datos de medios para la pista comprenden una secuencia de muestras; y dispositivos de video. | |
BR112014030110A8 (pt) | métodos e aparelhos para coletar informação de usuário distribuída para impressões de mídia e termos de pesquisa | |
BR112022007396A2 (pt) | Método para acesso seletivo de dados, método e sistema para compactação de dados | |
PH12019500791A1 (en) | Efficient data structures for bioinformatics information presentation | |
BR102014027639A8 (pt) | método para resolver as entidades de uma pluralidade de documentos, e sistema de resolução de entidade para a resolução de entidade de uma pluralidade de documentos | |
BR112016013587A8 (pt) | método, dispositivo sem fio e sistema para sintetização de página da rede em dispositivos sem fio | |
Hurd | Finding No Fault with Negligence | |
BR112015031171A2 (pt) | obtenção de um ligante terapêutico melhorado | |
Zouhair et al. | Contagion versus interdependence: The case of the BRIC Countries during the subprime crises | |
Martens | Sarapis as Healer in Roman Athens: Reconsidering the Identity of Agora S 1068 | |
Mocnik et al. | The effect of tectonic plate motion on OpenStreetMap data | |
Katsela | The city logistics-based business model: a series of components | |
Ruan | A statistical method for rare variants association studies in pedigree data | |
Lenneis | Reconstruction of domestic units based upon distribution analysis and study of the finds density in pit fills | |
Jarlert | Political Reform in Sweden | |
Urkin et al. | Challenges in the provision of health to the rural Bedouin population in southern Israel | |
Lupton | Foreword: Social and Cultural Perspectives on Health, Technology and Medicine | |
Celis et al. | Parliamentary Bodies and the Quality of Women’s Substantive Representation: a comparative analysis of UK and Belgian women’s parliamentary bodies. | |
Cahyani et al. | The Importance of Preserving Tacit Knowledge for Natural Disaster Casualties Anticipation | |
Possing | Representing Gendered Individualities: Reflections on the Biographical Turn | |
Rodriguez et al. | Available Resources for Reconfigurable Systems in 5G Networks |