EA201990935A1 - Способ и устройство для компактного представления данных биоинформатики - Google Patents

Способ и устройство для компактного представления данных биоинформатики

Info

Publication number
EA201990935A1
EA201990935A1 EA201990935A EA201990935A EA201990935A1 EA 201990935 A1 EA201990935 A1 EA 201990935A1 EA 201990935 A EA201990935 A EA 201990935A EA 201990935 A EA201990935 A EA 201990935A EA 201990935 A1 EA201990935 A1 EA 201990935A1
Authority
EA
Eurasian Patent Office
Prior art keywords
compact representation
bioinformatics data
data
encoded
split
Prior art date
Application number
EA201990935A
Other languages
English (en)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed filed Critical
Publication of EA201990935A1 publication Critical patent/EA201990935A1/ru

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/10Ontologies; Annotations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/50Compression of genetic data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioethics (AREA)
  • Databases & Information Systems (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

Способ и устройство для сжатия данных геномной последовательности, сгенерированных секвенаторами генома. Последовательности нуклеотидов выравнивают по одной или более референсным последовательностям, классифицируют в соответствии со степенями точности совпадения, кодируют в виде множества слоев синтаксических элементов, используя разные модели источников и энтропийные кодеры для каждого слоя, на которые разбиты данные.
EA201990935A 2016-10-11 2016-10-11 Способ и устройство для компактного представления данных биоинформатики EA201990935A1 (ru)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2016/074307 WO2018068829A1 (en) 2016-10-11 2016-10-11 Method and apparatus for compact representation of bioinformatics data

Publications (1)

Publication Number Publication Date
EA201990935A1 true EA201990935A1 (ru) 2019-11-29

Family

ID=57241050

Family Applications (2)

Application Number Title Priority Date Filing Date
EA201990935A EA201990935A1 (ru) 2016-10-11 2016-10-11 Способ и устройство для компактного представления данных биоинформатики
EA201990922A EA201990922A1 (ru) 2016-10-11 2017-02-14 Способ и система для избирательного доступа к записанным в память или передаваемым биоинформационным данным

Family Applications After (1)

Application Number Title Priority Date Filing Date
EA201990922A EA201990922A1 (ru) 2016-10-11 2017-02-14 Способ и система для избирательного доступа к записанным в память или передаваемым биоинформационным данным

Country Status (22)

Country Link
US (1) US20200051664A1 (ru)
EP (2) EP4235680A3 (ru)
JP (1) JP2020503580A (ru)
KR (1) KR20190071741A (ru)
CN (1) CN110168649A (ru)
AU (1) AU2016426571A1 (ru)
BR (1) BR112019007315A2 (ru)
CA (1) CA3039690A1 (ru)
CL (1) CL2019000957A1 (ru)
CO (1) CO2019003587A2 (ru)
EA (2) EA201990935A1 (ru)
ES (1) ES2947521T3 (ru)
FI (1) FI3526711T3 (ru)
HU (1) HUE062006T2 (ru)
IL (1) IL265906A (ru)
MX (1) MX2019004124A (ru)
PH (1) PH12019500793A1 (ru)
PL (1) PL3526711T3 (ru)
SA (1) SA519401514B1 (ru)
SG (1) SG11201903177PA (ru)
WO (1) WO2018068829A1 (ru)
ZA (1) ZA201902786B (ru)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI4075438T3 (fi) * 2016-10-11 2024-03-14 Genomsys Sa Tehokkaat datarakenteet bioinformatiikkainformaation esittämistä varten
US20210074381A1 (en) 2019-09-11 2021-03-11 Enancio Method for the compression of genome sequence data
EP3896698A1 (en) 2020-04-15 2021-10-20 Genomsys SA Method and system for the efficient data compression in mpeg-g

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012168815A2 (en) * 2011-06-06 2012-12-13 Koninklijke Philips Electronics N.V. Method for assembly of nucleic acid sequence data
US10902937B2 (en) * 2014-02-12 2021-01-26 International Business Machines Corporation Lossless compression of DNA sequences

Also Published As

Publication number Publication date
EP3526711A1 (en) 2019-08-21
CO2019003587A2 (es) 2019-08-30
EA201990922A1 (ru) 2019-08-30
ZA201902786B (en) 2020-11-25
HUE062006T2 (hu) 2023-09-28
EP4235680A2 (en) 2023-08-30
WO2018068829A1 (en) 2018-04-19
KR20190071741A (ko) 2019-06-24
IL265906A (en) 2019-06-30
FI3526711T3 (fi) 2023-06-27
CN110168649A (zh) 2019-08-23
PH12019500793A1 (en) 2019-12-02
US20200051664A1 (en) 2020-02-13
PL3526711T3 (pl) 2023-08-14
JP2020503580A (ja) 2020-01-30
EP3526711B1 (en) 2023-03-29
SA519401514B1 (ar) 2024-01-04
ES2947521T3 (es) 2023-08-10
BR112019007315A2 (pt) 2019-09-17
AU2016426571A1 (en) 2019-06-06
CL2019000957A1 (es) 2019-08-23
CA3039690A1 (en) 2018-04-19
SG11201903177PA (en) 2019-05-30
MX2019004124A (es) 2019-06-10
EP4235680A3 (en) 2023-10-11

Similar Documents

Publication Publication Date Title
CO2019009920A2 (es) Método y aparato para la representación compacta de datos de bioinformática mediante el uso de múltiples descriptores genómicos
GB2545070A (en) Generating molecular encoding information for data storage
WO2018057959A3 (en) Operation of a library preparation system to perform a protocol on a biological sample
EP4289996A3 (en) Nucleic acid indexing techniques
BR112016029387A2 (pt) sistemas e métodos para cópia intra-bloco
BR112016029871A2 (pt) sistemas e métodos para restrição de parâmetros de formato de representação para um conjunto de parâmetros
EP3754484A3 (en) Generating encoding software and decoding means
MX2016011079A (es) Generalizador de certificacion de conduccion autonoma.
MX2022013015A (es) Sistemas y metodos de uso para utilizar en la identificacion de multiples ediciones genomicas y predecir los efectos acumulados de las ediciones genomicas identificadas.
EA201990986A1 (ru) Способы и системы анализа хроматографических данных
EA201990959A1 (ru) Химерные антигенные рецепторы, нацеленные на антиген созревания b-клеток
WO2014200912A3 (en) Mathematical processes for determination of peptidase cleavage
SE0701690L (sv) Generering av en dataström och identifiering av positioner inuti en dataström
SA517380741B1 (ar) طريقة ومعدة لتحليل جين
EA201990935A1 (ru) Способ и устройство для компактного представления данных биоинформатики
EA201990933A1 (ru) Эффективные структуры данных для представления информации биоинформатики
MX2016010100A (es) Secuenciacion libre de error de acido desoxirribonucleico (adn).
CY1122723T1 (el) Μεθοδος μονοσημαντης και σαφους εξαγωγης κλειδιων απο ενα καναλι διαδοσης
EA201991908A1 (ru) Способ и устройство для компактного представления биоинформационных данных с помощью нескольких геномных дескрипторов
EA201990920A1 (ru) Способ и система для запоминания биоинформационных данных и доступа к ним
NZ711109A (en) Polymerase chain reaction detection system
TR201906026T4 (tr) Kriptografik sistem ve yöntem.
EA201991907A1 (ru) Способ и системы для эффективного сжатия прочтений геномной последовательности
TW201613303A (en) Authentication method for communication
PE20191228A1 (es) Metodo y aparato para representacion compacta de datos bioinformaticos