KR20080102006A - 공간분할 방식을 이용한 유전자 서열 정렬 방법 - Google Patents
공간분할 방식을 이용한 유전자 서열 정렬 방법 Download PDFInfo
- Publication number
- KR20080102006A KR20080102006A KR1020070048253A KR20070048253A KR20080102006A KR 20080102006 A KR20080102006 A KR 20080102006A KR 1020070048253 A KR1020070048253 A KR 1020070048253A KR 20070048253 A KR20070048253 A KR 20070048253A KR 20080102006 A KR20080102006 A KR 20080102006A
- Authority
- KR
- South Korea
- Prior art keywords
- block
- algorithm
- matrix
- optimal alignment
- alg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 25
- 238000002864 sequence alignment Methods 0.000 title claims abstract description 10
- 238000000638 solvent extraction Methods 0.000 title claims abstract description 10
- 238000000034 method Methods 0.000 claims abstract description 59
- 239000011159 matrix material Substances 0.000 claims abstract description 26
- 238000005192 partition Methods 0.000 claims description 11
- 238000004364 calculation method Methods 0.000 claims description 5
- 238000012545 processing Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 108020004414 DNA Proteins 0.000 description 1
- 101100490849 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) alg-2 gene Proteins 0.000 description 1
- 101100378851 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) alg-3 gene Proteins 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
Landscapes
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Medical Informatics (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Biology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Public Health (AREA)
- Software Systems (AREA)
- Bioethics (AREA)
- Evolutionary Computation (AREA)
- Epidemiology (AREA)
- Databases & Information Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Chemical & Material Sciences (AREA)
- Crystallography & Structural Chemistry (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (5)
- 유전자 서열의 정렬 방법에 있어서,DPA 매트릭스를 행과 열에 대한 임의의 정수로 분할함으로써 부분 매트릭스 및 블록 매트릭스를 생성하는 단계;상기 생성된 각각의 블록 매트릭스에 대해 DPA를 시행함으로써 블록 내 부분 최적 정렬을 구하는 단계; 및상기 구해진 각각의 블록 내 부분 최적 정렬에 대한 역추적 알고리즘을 이용하여 전체 최적 정렬을 구하는 단계를 포함하는 공간분할 방식을 이용한 유전자 서열 정렬 방법.
- 제 1 항에 있어서,상기 DPA 매트릭스를 행과 열에 대한 임의의 정수로 분할함에 있어, 상기 DPA 매트릭스가 임의의 정수로 나누어지지 않는 경우 하기의 [정리]를 적용하는 단계를 더 포함하는 공간분할 방식을 이용한 유전자 서열 정렬 방법.[정리]서열 S의 길이를 m, 분할수를 정수 d, 블록테이블의 크기를 βR라 할 때, m/d≤m%d이면, 블록 크기 βR=(m/d +1)이고, 그렇지 않으면, βR=m/d이다. (단, /=몫, %=나머지
- 제 1 항 또는 제 2 항에 있어서,상기 부분 매트릭스 및 블록 매트릭스에 대응되는 분할 테이블(D') 및 블록 테이블(B')은 다음의 [알고리즘]을 통해 구해지는 것을 특징으로 하는 공간분할 방식을 이용한 유전자 서열 정렬 방법.[알고리즘](1) Calculate : βR, βC.(2) Create table :R_table, C_table, B_table, H_table(3) Initialization: H_Table(1,j) ←j*σ(-, i)[j=0,...,n ];(4) for i←1 to m dobegin(5) H_Table (0,j)←H_Table(1,j) [j=0,...,n ];(6) for j←1 to n doDiagonal ←H_Table (0,j-1) + σ(S[i],T[j]),Vertical ←H_Table(0,j) + σ(S[i],-),Horizontal ←H_Table(1,j-1 ) + σ(-,T[j])A(1,j)←max(Diagonal, Vertical, Horizontal)(7) [ALG A], [ALG B]end(8) Give output: D', B'
- 제 3 항에 있어서,상기 [알고리즘]에 사용되는 [ALG A]와 [ALG B]는 각각 행 테이블 작성을 위한 엔트리 선택 알고리즘([ALG A]) 및 열 테이블 작성을 위한 엔트리 선택 알고리즘([ALG B])으로써, 다음과 같이 구성되는 것을 특징으로 하는 공간분할 방식을 이용한 유전자 서열 정렬 방법.[ALG A]beginR_Table[i][col]←H_Table[1][j], [j=j+βC, 0≤j≤n]col++;end[ALG B]beginif i% βR=0row++;C_Table[row][j] ←H_Table[1][j], [j=0,...,n ];end
- 제 4 항에 있어서,상기 각각의 블록 내 부분 최적 정렬을 구하는 단계는, 단어 단위의 엔트리 값 계산을 통해 이루어지는 것을 특징으로 하는 공간분할 방식을 이용한 유전자 서열 정렬 방법.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020070048253A KR20080102006A (ko) | 2007-05-17 | 2007-05-17 | 공간분할 방식을 이용한 유전자 서열 정렬 방법 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020070048253A KR20080102006A (ko) | 2007-05-17 | 2007-05-17 | 공간분할 방식을 이용한 유전자 서열 정렬 방법 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20080102006A true KR20080102006A (ko) | 2008-11-24 |
Family
ID=40287986
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020070048253A Ceased KR20080102006A (ko) | 2007-05-17 | 2007-05-17 | 공간분할 방식을 이용한 유전자 서열 정렬 방법 |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20080102006A (ko) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101322123B1 (ko) * | 2013-06-14 | 2013-10-28 | 인하대학교 산학협력단 | 교환 연산을 포함한 확장편집거리의 계산을 병렬적으로 수행하기 위한 방법 |
KR20180130755A (ko) | 2017-05-30 | 2018-12-10 | 단국대학교 산학협력단 | Dna 샷건 시퀀싱 또는 rna 전사체 어셈블리를 위한 콘티그 프로파일의 업데이트 방법 및 콘티그 형성 방법 |
-
2007
- 2007-05-17 KR KR1020070048253A patent/KR20080102006A/ko not_active Ceased
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101322123B1 (ko) * | 2013-06-14 | 2013-10-28 | 인하대학교 산학협력단 | 교환 연산을 포함한 확장편집거리의 계산을 병렬적으로 수행하기 위한 방법 |
KR20180130755A (ko) | 2017-05-30 | 2018-12-10 | 단국대학교 산학협력단 | Dna 샷건 시퀀싱 또는 rna 전사체 어셈블리를 위한 콘티그 프로파일의 업데이트 방법 및 콘티그 형성 방법 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Gupta et al. | RAPID: A ReRAM processing in-memory architecture for DNA sequence alignment | |
Sato et al. | RNA secondary structural alignment with conditional random fields | |
Matsui et al. | Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures | |
CN112735528A (zh) | 一种基因序列比对方法及系统 | |
US8965935B2 (en) | Sequence matching algorithm | |
Cheetham et al. | Parallel clustal w for pc clusters | |
Knudsen | Optimal multiple parsimony alignment with affine gap cost using a phylogenetic tree | |
Garrison | Graphical pangenomics | |
KR20080102006A (ko) | 공간분할 방식을 이용한 유전자 서열 정렬 방법 | |
Kaghed et al. | Multiple sequence alignment based on developed genetic algorithm | |
Ecker et al. | A machine-learning-based alternative to phylogenetic bootstrap | |
Christodoulakis et al. | Computation of repetitions and regularities of biologically weighted sequences | |
Wheeler et al. | Optimizing reduced-space sequence analysis | |
Nath et al. | A survey on longest common subsequence | |
Myoupo et al. | Time-efficient parallel algorithms for the longest common subsequence and related problems | |
Yanovsky et al. | Read mapping algorithms for single molecule sequencing data | |
Moyer et al. | Motif identification using CNN-based pairwise subsequence alignment score prediction | |
WO2007025800A1 (en) | Global alignment of sequence data | |
Marcolin et al. | Efficient k-mer Indexing with Application to Mapping-free SNP Genotyping. | |
Zhang et al. | Parallel divide and conquer bio-sequence comparison based on Smith-Waterman algorithm | |
Bannai et al. | Finding optimal pairs of patterns | |
US20090325820A1 (en) | Hardware acceleration for thermodynamically constrained DNA code generation | |
Frid et al. | A Simple, Practical and Complete-Time Algorithm for RNA Folding Using the Four-Russians Speedup | |
Blassel | From sequences to knowledge, improving and learning from sequence alignments | |
Menolascina et al. | A multi-objective genetic algorithm based approach to the optimization of oligonucleotide microarray production process |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
PA0109 | Patent application |
Patent event code: PA01091R01D Comment text: Patent Application Patent event date: 20070517 |
|
PA0201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
PE0902 | Notice of grounds for rejection |
Comment text: Notification of reason for refusal Patent event date: 20081028 Patent event code: PE09021S01D |
|
PG1501 | Laying open of application | ||
E601 | Decision to refuse application | ||
PE0601 | Decision on rejection of patent |
Patent event date: 20090507 Comment text: Decision to Refuse Application Patent event code: PE06012S01D Patent event date: 20081028 Comment text: Notification of reason for refusal Patent event code: PE06011S01I |
|
J201 | Request for trial against refusal decision | ||
PJ0201 | Trial against decision of rejection |
Patent event date: 20090707 Comment text: Request for Trial against Decision on Refusal Patent event code: PJ02012R01D Patent event date: 20090507 Comment text: Decision to Refuse Application Patent event code: PJ02011S01I Appeal kind category: Appeal against decision to decline refusal Decision date: 20091030 Appeal identifier: 2009101006397 Request date: 20090707 |
|
AMND | Amendment | ||
PB0901 | Examination by re-examination before a trial |
Comment text: Amendment to Specification, etc. Patent event date: 20090805 Patent event code: PB09011R02I Comment text: Request for Trial against Decision on Refusal Patent event date: 20090707 Patent event code: PB09011R01I |
|
E801 | Decision on dismissal of amendment | ||
PE0801 | Dismissal of amendment |
Patent event code: PE08012E01D Comment text: Decision on Dismissal of Amendment Patent event date: 20090813 Patent event code: PE08011R01I Comment text: Amendment to Specification, etc. Patent event date: 20090805 |
|
B601 | Maintenance of original decision after re-examination before a trial | ||
PB0601 | Maintenance of original decision after re-examination before a trial | ||
J801 | Dismissal of trial |
Free format text: REJECTION OF TRIAL FOR APPEAL AGAINST DECISION TO DECLINE REFUSAL REQUESTED 20090707 Effective date: 20091030 |
|
PJ0801 | Rejection of trial |
Patent event date: 20091030 Patent event code: PJ08011S01D Comment text: Decision on Dismissal of Request for Trial (Dismissal of Decision) Decision date: 20091030 Appeal kind category: Appeal against decision to decline refusal Appeal identifier: 2009101006397 Request date: 20090707 |