WO2004008361A1 - 塩基配列関連情報を用いた情報処理システム - Google Patents
塩基配列関連情報を用いた情報処理システム Download PDFInfo
- Publication number
- WO2004008361A1 WO2004008361A1 PCT/JP2003/008971 JP0308971W WO2004008361A1 WO 2004008361 A1 WO2004008361 A1 WO 2004008361A1 JP 0308971 W JP0308971 W JP 0308971W WO 2004008361 A1 WO2004008361 A1 WO 2004008361A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- polymorphism
- base sequence
- semantic
- semantic information
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2299/00—Coordinates from 3D structures of peptides, e.g. proteins or enzymes
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
Definitions
- the present invention relates to, for example, an information processing system for providing information via a communication network.
- genomic nucleotide sequences of various organisms including humans are being rapidly determined, and genomic nucleotide sequence information is accumulated in various databases.
- a system is being constructed that allows various research institutions and researchers to use genomic base sequence information stored in a database via an information communication network such as the Internet.
- genomic drug discovery research and analysis of genetic information have been actively conducted using the nucleotide sequences contained in such genomic nucleotide sequence information. Attention has been paid to differences in base sequences.
- differences in the nucleotide sequence between individuals are defined as polymorphisms defined as having a given base difference at a frequency of 1% or more in an individual species, and having a given base difference of 1% in an individual species.
- polymorphisms include single nucleotide polymorphism (SNP; Single Nucleotide Polymorphism), which is a single base difference between individuals, and deletion or insertion of one to several tens of bases (sometimes several thousand bases).
- SNP Single nucleotide polymorphism
- VNTR Very Number of Tandem Repeat
- microsatellite polymorphism replicateated sequence of about 2 to 4 bases
- Such polymorphisms may affect the amino acid sequence of a protein between individuals or the expression efficiency of a given gene between individuals. Due to such effects, for example, it is known that the susceptibility to a predetermined disease differs among individuals, and that the sensitivity to a predetermined drug differs between individuals. However, there is no system that effectively utilizes the difference in nucleotide sequence information between individuals such as polymorphisms and provides useful semantic information to each individual. It is the current situation. Disclosure of the invention
- the present invention can provide useful semantic information for each individual and / or information related to the semantic information by effectively utilizing differences in base sequence information between individuals,
- the goal is to build a highly secure information processing system.
- the information processing method for a nucleotide sequence according to the present invention includes a method for providing semantic information and / or information related to the semantic information, the method for providing the semantic information and And / or information related to the semantic information and position information corresponding to the semantic information and / or the information related to the semantic information, and transmitted.
- the side that sends out the base sequence-related information may be configured such that the side that sends out the base sequence-related information has meaning information that means the sent base sequence-related information, And the position information corresponding to the semantic information and / or the information related to the semantic information.
- the information processing method for a base sequence according to the present invention can be realized as a program that causes a computer including hardware such as a control device, a transmission / reception device, and a storage device to execute each step. Further, the information processing method for a base sequence according to the present invention is realized as a recording medium recording a program for causing a computer including hardware such as a control device, a transmission / reception device, and a storage device to execute each step. You can also.
- the information processing method relating to a base sequence according to the present invention can be realized as an information processing device including hardware such as a control device that executes each step, a transmitting / receiving device, and a storage device.
- This description includes part or all of the contents as disclosed in the description and / or drawings of Japanese Patent Application No. 2002-205550, which is a priority document of the present application.
- FIG. 1 is a schematic configuration diagram schematically showing a configuration of an information processing system to which the present invention is applied.
- FIG. 2 is a schematic configuration diagram schematically showing the configuration of a shared computer.
- FIG. 3 is a configuration diagram showing an example of data recorded in the main DB.
- FIG. 4 is a schematic configuration diagram schematically showing the configuration of a personal computer.
- FIG. 5 is a configuration diagram showing an example of data recorded on a genome-related information recording medium.
- FIG. 6 is a flowchart showing a process performed by a shared computer and a personal computer in a system for providing a morbidity of a predetermined disease.
- FIG. 7 is a continuation of FIG. 6, and is a flow chart showing processing on a shared computer and a personal computer in a system for providing morbidity for a predetermined disease.
- an information processing system for providing a user with a possibility of a predetermined disease will be described.
- a case where the user requests that he / she want to know his / her morbidity with respect to a predetermined disease as the “request for goods and / or services” will be described as an example.
- an information processing system capable of recognizing illegal use and collection of nucleotide sequence-related information will be described, but for convenience of description, a simplified model will be described.
- the “goods and / or services” are not limited to this, and may be, for example, suitable for the constitution of an individual (individual). This includes services such as combined medicines, foods and luxury goods, and information and other information that are suited to the constitution and characteristics of the individual (individual).
- the information processing system includes a communication network 1 such as the Internet, a shared computer 2 connected to the communication network 1, and a plurality of personal computers 3 connected to the communication network 1. To enable data communication between the shared computer 2 and the personal computer 3 via the communication network 1.
- a communication network 1 such as the Internet
- shared computer 2 connected to the communication network 1
- personal computers 3 connected to the communication network 1.
- the shared computer 2 includes, as shown in FIG. 2, a CPU 4 for controlling all operations of the shared computer 2, an input device 5 such as a keyboard and a mouse for inputting information and a program execution instruction, and a display device.
- Display device 6 a memory 7 for storing temporary information and non-rewritable information, etc., a database 8 storing various data, and writing predetermined information to these memories 7 and database 8. It comprises a recording device 9 and a transmitting / receiving device 17 for transmitting / receiving information to / from the personal computer 3 via the communication network 1.
- the memory 7 of the shared computer 2 includes a memory section A 10 and a memory section B 11 for recording different types of information, respectively, and a screen memory 1 2 for recording image data to be displayed on the personal computer 3 and the display device 6, for example. And a processing program 13 for operating the present system.
- the shared computer 2 does not have the screen memory 12 and the processing program 13 etc. in the internal memory 7, but has an external storage device (not shown) connected to the shared computer 2 via the communication network 1. Zu) may be included.
- the database 8 (storage device) in the shared computer 2 includes a main DB 14 in which polymorphism addresses, polymorphism patterns, and semantic information are recorded, and a storage DB that stores information recorded in the memory unit A 10- 15 and a storage DB-B 16 for storing the information recorded in the memory section B 11.
- the main DB 14 records the polymorphism address, a plurality of possible polymorphism patterns at the polymorphism address, and semantic information indicating each of the plurality of polymorphism patterns in association with each other. Have been.
- the main DB 14 may record semantic information indicating a combination of polymorphism patterns at a plurality of polymorphism addresses (for example, a haplotype).
- polymorphism address means at least the position where the polymorphism exists in the nucleotide sequence.
- a polymorphism is, for example, a so-called SNP (si Includes nanonucleotide polymorphism, RFLP (restriction fragment length of polymorphism), YNTR (variable number of tandem repeat) ⁇ microsatellite, etc.
- SNP includes nanonucleotide polymorphism
- RFLP restriction fragment length of polymorphism
- YNTR variable number of tandem repeat
- polymorphism address is meant to include bases that occur only at a frequency of less than 1% in an individual species and base sequence changes that indicate changes in the base sequence. That is, the “polymorphism address” indicates a position indicating a polymorphism or the like by combining numerical values, characters, symbols, and the like.
- the polymorphism address is not particularly limited.
- the polymorphism number can be represented by a combination of a chromosome number, a symbol indicating the gene in which the polymorphism exists, and a numerical value indicating the location of the polymorphism in the gene, or a polymorphism. May be a combination of a symbol indicating a gene in which is present and a numerical value indicating the location of a polymorphism in the gene.
- the polymorphism address may be a polymorphism-specific notation assigned to each polymorphism.
- a polymorphism-specific notation is used as the polymorphism address, the polymorphism address does not directly indicate the position in the nucleotide sequence, but the position can be indirectly known based on the polymorphism-specific notation. Therefore, "polymorphism address" is meant to include notations specific to polymorphism.
- Polymorphism pattern is information on a base sequence that differs between individuals, and includes at least the base or base sequence pattern in the polymorphism. Further, the term “polymorphism pattern” is not limited to polymorphism, but also includes patterns of bases and base sequences that are present at a frequency of less than 1% in an individual species. For example, at a polymorphism address known to take A or G, the “polymorphism pattern” is represented by either “A” or “G”.
- the “polymorphism pattern” may indicate a heterozygote or a homozygote in a homologous chromosome.
- the “polymorphism pattern” can be represented by any of “AA”, “GG”, and “AG”.
- the “polymorphism pattern” does not directly represent a pattern that can be taken at a predetermined polymorphism address, but may be an indirect representation.
- a “polymorphism pattern” is, for example, “allele 1” when taking “A” at a polymorphism address known to take A or G, and “allele 2” when taking “G”. Notation May be.
- the “polymorphism pattern” can be represented by any of “AA”, “GG” and “AG” as described above, for example, when it can be represented by “AA”, it is represented by “HI” and “GG” When it can be expressed, it may be expressed as “” or when expressed as “AG”, it may be expressed as “ ⁇ ”.
- the polymorphism pattern may be encrypted or unencrypted.
- polymorphism pattern examples are numerical values representing the “number of repetitions” when the polymorphism is a microsatellite, and “present / absent” when the polymorphism is a deletion type. It may be represented by a symbol.
- the “polymorphism pattern” at each polymorphism address may be described as, for example, “polymorphism 1”, “polymorphism 2”, or “polymorphism 3” in accordance with predetermined regulations and rules.
- polymorphism 1”, “polymorphism 2”, and “polymorphism 3” can be described in descending order of the possible frequency of “polymorphism pattern”.
- each “polymorphism 1” at each polymorphism address does not necessarily represent the same content. That is, for example, "Polymorphism 1" of one polymorphism address represents " ⁇ ⁇ ", which is the most likely frequency, and another "Polymorphism 1" is the most likely frequency, "GG”. Will be.
- the “semantic information” is information associated with the “polymorphism pattern”, and includes, for example, responsiveness to a drug, side effects to a drug, risk for a disease or a disorder, constitution, nature, constitution, nature, and the like. It refers to various information resulting from differences in "polymorphism patterns", such as lifestyle-based adipices and protein interactions. As the “meaning information”, various information resulting from the difference in the “polymorphism pattern” may be directly represented, or may be indirectly represented by using a symbol or the like indicating the information. “Semantic information” is information of a type that will increase and be corrected as research on genomes and genes progresses, and it is preferable to always keep up with the version. In other words, “semantic information” becomes more accurate by increasing / decreasing the amount of accumulation by updating the database using the results of genomic / gene research.
- the semantic information is stored in the main DB 14 at least as “annotation information for the polymorphism pattern” associated with the predetermined “polymorphism address” and “polymorphism pattern”. Has been recorded.
- the semantic information is associated with “polymorphism classification” and “classification (disease name)” corresponding to a predetermined “polymorphism address”. That is, when the predetermined “polymorphism address” is the predetermined “polymorphism pattern”, it is possible to obtain annotation information (semantic information) indicating the type of the disease name and the likelihood of being affected by the disease.
- the semantic information can be correlated with a combination of each polymorphism pattern corresponding to a plurality of polymorphism addresses (for example, a haplotype). That is, for each combination of polymorphism patterns at a plurality of polymorphism addresses, it is possible to associate annotation information (semantic information) indicating a different morbidity of a predetermined disease. In this case, when a plurality of polymorphism addresses are a combination of predetermined polymorphism patterns, it is possible to obtain annotation information (semantic information) indicating the possibility of illness for a predetermined disease.
- the “public level” determined based on a predetermined standard can be associated with the semantic information.
- the criteria for determining the “disclosure level” consider semantic information, that is, the unexpected disadvantage to an individual by disclosing the likelihood of “classification (disease name)” here. Can be determined.
- the public computer 2 determines the ⁇ disclosure level '' so that semantic information that is not appropriate to be disclosed is not disclosed in view of laws, regulations, own standards of conduct, contracts with users, etc. can do.
- the system does not disclose to the user the annotation information that indicates the likelihood of morbidity that is associated with the “disclosure level,” which means that disclosure is not possible.
- the semantic information includes information about the source of the semantic information.
- the “information source” indicates the source of the semantic information, such as the organization that created the semantic information and the organization that provided the semantic information to the shared computer 2. By associating the “information source” with the semantic information, the credibility of the semantic information can be determined.
- the database 8 in the storage DB-B16, for example, data including base sequence-related information, which is genetic information of a requester who uses the present system, can be recorded.
- the storage DB-A 15 can record data such as, for example, information for identifying a requester who uses the system. In this way, by recording the genetic information of the individual and the information identifying the individual separately in the storage DB-A 15 and the storage DB-B 16, the genetic information of the requester and the requester can be obtained. It is difficult to associate with the data that specifies
- the shared computer 2 is not limited to the one having the database 8 therein, and may have an external database (not shown) connected to the shared computer 2 via the communication network 1.
- the shared computer 2 may have a plurality of databases 8 therein, or may have the internal database 8 and an external database connected to the shared computer 2 via the communication network 1. May be.
- the personal computer 3 includes a CPU 20 for controlling all the operations of the personal computer 3 and an input device 2 such as a keyboard and a mouse for inputting information and a program execution instruction.
- a display device such as a display device 22; a memory 23 for storing temporary information and rewritable information; a reading device 25 for reading data from a genome-related information recording medium 24;
- a transmission / reception device 29 for transmitting / receiving information to / from the shared computer 2 via the communication network 1.
- the personal computer 3 is not limited to a normal computer, but may be in any form such as a mobile phone, a personal mobile terminal, and other mobile communication devices.
- the memory 23 of the personal computer 3 has a memory unit 26 for recording information and the like from the genome-related information recording medium 24, and a processing program 27 for operating the information processing system is recorded.
- the genome-related information recording medium 24 stores the genome-related information 28 of an individual.
- Examples of the genome-related information recording medium 24 include a magnetic recording medium such as a magnetic disk and a magnetic card, an optical recording medium to which a magneto-optical recording method and a phase change recording method are applied, a semiconductor memory, and the like.
- the genome-related information recording medium 24 may be in the form of a card, a disk, a stick, a tape, a drum, or the like. May be used. Further, the genome-related information recording medium 24 may be a recording of the genome-related information 28 of a single individual (individual), but may be a plurality of genome-related information 2 of a plurality of individuals (individuals). 8 may be recorded.
- the genome-related information 28 included in the genome-related information recording medium 24 is at least a “polymorphism address” and a “polymorphism pattern” at a predetermined polymorphism address obtained as a result of analyzing the base sequence of an individual (individual). Means.
- the genome-related information 28 may include various types of information such as preexisting diseases, characteristics, medical record information, and health check results.
- the genome-related information recording medium 24 as the genome-related information 28, for example, as shown in FIG. 5, as the data I, an individual number “Gno. It records personal information such as days, records polymorphism addresses and polymorphism patterns as data II, records existing illness as data III, records characteristics as data IV, and records medical record information as data V.
- the genome-related information 28 consists of data I, data II, data I II, data IV and data V. Data I and data II contain essential information, and data III and data Data IV and Data V consist of additional information.
- the “polymorphism address” corresponding to the position on the base sequence and the “polymorphism pattern” at the polymorphism address are linked and recorded. Further, in the data II, additional information at a predetermined polymorphism address may be recorded as a “comment” linked to the “polymorphism address”. In addition, the entire base sequence of a predetermined individual may be recorded in the data II. Even when the entire base sequence is recorded in Data II, “Polymorphism address” and “Polymorphism pattern” are included in Data II.
- the personal computer 3 and the genome-related information recording medium 2 are identical to the personal computer 3 and the genome-related information recording medium 2
- the genome-related information recording medium includes a memory unit having a processing program
- the personal computer uses the genome-related information recording medium.
- a configuration in which the processing program is operated by being attached may be employed.
- the personal computer can operate in accordance with the processing program recorded in the memory of the genome-related information recording medium.
- the processing program 13 recorded in the memory 7 of the shared computer 2 and the memory of the personal computer 3 The processing program 27 recorded in step 23 performs an information processing operation in accordance with, for example, flowcharts as shown in FIGS.
- the steps described as “(shared)” mean the processing in the shared computer 2
- the steps described as “()” are the processing in the personal computer 3. Meaning.
- each individual possessing the genome-related information recording medium 24 accesses the shared computer 2 through the communication network 1 using the personal computer 3 and records the information in the main DB 14 of the shared computer 2 It is a system that uses the semantic information that has been used.
- the information processing system uses a genome-related information recording medium 24 on which genome-related information 28 of a plurality of persons is recorded, and each individual accesses the genome-related information recording medium 24. It may be.
- An individual who uses this system is, for example, a person who has a genome-related information recording medium 24.
- the individual using the system (hereinafter referred to as “requester”) starts the processing program 27 recorded in the memory 23 at step Al (SA1) and reads the personal computer 3
- the device 25 is driven to access the genome-related information recording medium 24, and “Gno.” Recorded as data I on the genome-related information recording medium 24 is read.
- the read “Gno.” Is stored in memory part 26.
- step A2 based on the screen image displayed on the display device 22 by the processing program 27, information requested by the requester to be provided, for example, "possibility of colon cancer" (Request information) is input to the personal computer 3 and from the personal computer 3 to the shared computer 2 via the communication network 1.
- Request information information requested by the requester to be provided, for example, "possibility of colon cancer"
- Step A3 the shared computer 2 receives “Possibility of colon cancer” and “Gno.”.
- the received “colorectal cancer morbidity” and “Gno.” are stored as request information in the memory unit A10.
- step A4 when the request information is received, the processing program 13 recorded in the memory 7 is started to access the main DB 14.
- the processing program 13 performs processing in the shared computer 2.
- step A5 according to the processing program 13, the “classification (disease name)” recorded in the main DB 14 is searched, and the requested “possibility of colon cancer” ( ) Is extracted.
- step A6 the data associated with “Classification (disease name)” (colorectal cancer) consistent with “possibility of colorectal cancer” from the data recorded in the main DB14 Read the “polymorphism address”.
- the read “polymorphism address” is stored in the memory unit A 10 as position information associated with the request information. That is, the “possibility of colon cancer” and the “polymorphism address” for the predetermined “Gno.” Are recorded in the memory unit A 10.
- step A7 the “Gno.” And “polymorphism address” recorded in the memory section A10 are transmitted to the personal computer 3, and the “polymorphism address” to be transmitted is corresponded.
- Yes Send the instruction information to submit the “polymorphism pattern” to the personal computer 3.
- it may be necessary to order the submission of additional information such as pre-existing illnesses and characteristics as needed.
- step A8 “Gno.”, “Polymorphism address” and instruction information transmitted from shared computer 2 are received.
- the received “Gno.” And “polymorphism address” are recorded in the memory part 26.
- step A9 the data II recorded on the genome-related information recording medium 24 is accessed according to the received instruction information.
- step A10 the data II recorded on the genome-related information recording medium 24 is searched according to the processing program 27, and the polymorphism pattern of the instructed polymorphism address is read. The information is stored in the memory unit 26 in association with the polymorphism pattern. At this time, it is preferable to access data I and confirm whether or not the “Gno.” Received in step A8 is correct.
- step A10 in addition to the polymorphism pattern, additional information recorded in data III, data IV, and data V may be read at the same time, and may be recorded in the memory unit 26 as necessary. .
- Step All the polymorphism pattern associated with the polymorphism address temporarily recorded in the memory unit 26 and the additional information recorded as necessary together with the "Gno.” Output to the shared computer 2 via the communication network 1.
- step A12 the polymorphism pattern associated with the polymorphism address and the necessary The additional information recorded according to is received by the shared computer 2, and the received polymorphism pattern is recorded in the memory unit A10 in association with the polymorphism address.
- step A7 the shared computer 2 sends instruction information for instructing the submission of the “polymorphism pattern”, and in step A10, the personal computer 3 multiplexes according to the instruction information.
- the pattern is read from the genome-related information recording medium 24.
- the present system may be a system that does not transmit the command information in step A7.
- step A10 the personal computer 3 searches the data ⁇ based on the polymorphism address received in step A8 according to the processing program 27, and searches the received polymorphism pattern for the polymorphism pattern. read out. Then, the personal computer 3 outputs the polymorphism pattern and the like to the shared computer 2 in step A11.
- the shared computer 2 obtains the polymorphism pattern of the “polymorphism address” associated with the “classification (disease name)” corresponding to the “possibility of colorectal cancer” in step A12. Can be.
- step A13 the main DB 14 is accessed to search for a received polymorphism address and a pattern that matches the polymorphism pattern.
- the main DB 14 a plurality of polymorphism patterns are recorded for one polymorphism address, and the received polymorphism address and its polymorphism pattern are stored in the main DB 14. Search for a match.
- step A14 the morbidity (semantic information) for colorectal cancer associated with the received polymorphism pattern is read out. That is, in step A14, the requester's morbidity for colorectal cancer can be read out according to the polymorphism address and polymorphism pattern submitted by the requester.
- the read morbidity is stored in the memory unit A10 in association with the requester's "Gno.”, "Polymorphism address" and "polymorphism pattern”.
- the likelihood of colorectal cancer may be stored in a form captured with additional information, or other information obtained from the additional information may be associated with the requester's ⁇ Gno. '' May be stored.
- step A15 the morbidity stored in the memory unit A10 and the polymorphism address used when reading out the morbidity from the main DB 14 are associated with each other.
- the requester requests the personal computer 3 via the communication network 1. Sent with "Gno.” That is, in step A15, the morbidity read out in step A14 and the polymorphism address corresponding to the morbidity are linked and transmitted as a set. In step A15, in addition to "Gno.” And the set of morbidity and polymorphism addresses corresponding to the morbidity, the "information source" recorded in the main DB 14 is added. It may be transmitted in association with the morbidity and a set of polymorphism addresses corresponding to the morbidity.
- the requester By transmitting the information source, the requester evaluates the credibility of the morbidity based on the information source and the polymorphism address corresponding to the morbidity and the polymorphism pattern corresponding to the polymorphism address. can do. That is, in addition to the information source received and the morbidity and the set of polymorphism addresses corresponding to the morbidity, the requester side receives the information transmitted by himself in step A11 in step A19 below. By associating the pattern patterns with each other and outputting them on the display device 22 of the personal computer 3 or on paper, the requester can determine that the organization described in the information source is a polymorphism pattern associated with the polymorphism address.
- the fact that the information about the morbidity has been prepared by using the information can be used to evaluate the reliability of the information about the morbidity. Also, on the information provider side, the requester will provide responsible services because the reliability is evaluated. If the requester has previously confirmed the polymorphism pattern associated with the polymorphism address provided by himself / herself through the informed consent, the informed consent confirmation information and the output information are not included. Whether they match can also be checked visually.
- step A16 the personal computer 3 receives the requester's "Gno. J” and a set of morbidity I "students and a polymorphism address corresponding to the morbidity. .
- the received morbidity and the set of polymorphism addresses corresponding to the morbidity are recorded in the memory section 26.
- step A17 the polymorphism address associated with the polymorphism pattern transmitted to shared computer 2 in step Al1, the morbidity received in step A16, and the It is determined whether or not a polymorphism address included in the set of polymorphism addresses corresponding to the morbidity (hereinafter referred to as “the polymorphism address received in step A16”) matches.
- the polymorphism address transmitted in step A11 and the polymorphism address received in step A16 completely match, it is determined as "yes", and the polymorphism number transmitted in step A11 is determined.
- step A 16 If not, judge as “noj.” If the polymorphism address transmitted in step A11 does not match the polymorphism address received in step A16, for example, transmit in step A11
- the polymorphism address transmitted in step A11 does not match the polymorphism address received in step A16, for example, transmit in step A11
- the notational match of the polymorphism address means that the notation at a predetermined position on the base sequence is the same.
- substantial match of polymorphism addresses means, for example, a case where the same position in the nucleotide sequence is directly or indirectly indicated even if the notation does not match.
- step A17 if the determination is "no”, the flow proceeds to step Al8 (SA18). If the determination is "yes”, the flow proceeds to step Al9 (SA19).
- step A18 all of the polymorphism patterns associated with the polymorphism address transmitted in step A11 in step A17 are used by shared computer 2 to acquire morbidity (semantic information). Since it is determined that this is not the case, a warning to that effect is displayed on the display device 22, and a warning to that effect is transmitted to the shared computer 2. After step A18, the process proceeds to step A19 described later, and the process is continued.
- warning refers to a command based on the judgment that the probability of unauthorized use 'collection is high, a notification that not all of the transmitted polymorphism patterns have been used, or all of the transmitted polymorphism patterns This includes a notice to notify a third party that the was not used.
- step A 19 the possibility of illness for colorectal cancer is displayed on the display device 22 from the semantic information recorded in the memory part 26 according to the processing program 27.
- the requester can obtain the possibility of morbidity for colorectal cancer using the genome-related information 28 recorded on the genome-related information recording medium 24. If “yes” is determined in step A 17, all of the polymorphism patterns associated with the polymorphism address transmitted in step A 1 determine the morbidity (semantic information) on shared computer 2. Perform step A 19 following step A 17 because it was found to be used to obtain.
- the “information related to semantic information” is further derived from the “semantic information” read in step A14, and in step A15, the “information related to semantic information” is derived together with the “semantic information”.
- step A3 in addition to the “possibility of colorectal cancer” desired by the requester, for example, when the possibility of occurrence exceeds Of the functional food that will prevent the disease as additional request information, together with information about the requester's morbidity of colorectal cancer, and the requested function if the morbidity exceeds a predetermined level. It is also possible to provide sex foods.
- Steps A3 to A7 and steps A12 and steps A12 to A15 in shared computer 2 may be performed by different organizations. In this case, the steps in the shared computer 2 are divided into two.
- the semantic information recorded in the main DB 14 is used to convert the polymorphism address by using the genome-related information recording medium 24 in which the individual polymorphism pattern is recorded in association with the polymorphism address. It can be used by individuals with intervention. In other words, individuals using this system do not need to record semantic information on the genome-related information recording medium, but only possess genome-related information 28 that associates polymorphism addresses with polymorphism patterns. Can get various semantic information.
- step A 16 a set of the morbidity and the polymorphism address corresponding to the morbidity is received from the shared computer 2. Then, in step A17, based on the polymorphism address received in step A16, all of the polymorphism patterns associated with the polymorphism address transmitted in step A11 are affected by shared computer 2. Confirmed whether it was used to acquire the possibility (semantic information).
- the requester determines whether the shared computer 2 has used all the polymorphism patterns associated with the polymorphism address transmitted in step A l 1 to acquire semantic information, or whether or not the shared computer 2 has used it. You can check. For example, in the shared computer 2, when the polymorphism pattern associated with the polymorphism address transmitted in step A11 is used for a purpose other than the request of the requester or when it is simply collected, Requesters can find use other than the intended purpose. If the shared computer 2 uses or collects the polymorphism pattern associated with the polymorphism address for a purpose other than the original purpose of presenting the requested morbidity, it should be concealed from the requester. In I can't.
- this system can be a deterrent to illegal use and collection of polymorphism addresses and polymorphism patterns recorded on the genome-related information recording medium 24 in the shared computer 2.
- step A18 all of the polymorphism patterns associated with the polymorphism address transmitted in step A11 were not used by shared computer 2 to acquire semantic information.
- the content of the warning is displayed on the display device 22 and a warning is transmitted to the shared computer 2.
- a warning is sent to shared computer 2 and the third party other than shared computer 2 is notified of the shared computer. It is preferable to disclose information on (2).
- the third party has a database for storing information from the personal computer 3 and a transmitting and receiving device for transmitting and receiving information to and from the personal computer 3 and the shared computer 2 via the communication network 1.
- the third party organization there is an organization that monitors compliance with the rules regarding transmission and reception of genome-related information via the communication network 1.
- the third party organization may be a public organization or a private organization.
- the third party accumulates information about the shared computer 2 received from the personal computer 3 in a database.
- a third-party organization may disclose the accumulated information on the shared computer 2 to the general public through the transmission / reception device, or warn about unauthorized use / collection of the polymorphism pattern associated with the polymorphism address May be transmitted to the shared computer 2.
- a storage medium having only the data I and additionally data III to V, which is obtained by removing the information included in the data II from the genome-related information recording medium may be used.
- the information contained in the data II is recorded in an external database (genome-related information recording medium) connected to the personal computer 3 via the communication network 1.
- an error is sent to an external database via the communication network 1.
- the requester can obtain semantic information as in the flowcharts shown in FIGS.
- the requester has neither the genome-related information recording medium 24 nor the recording medium except for the information included in the data II from the genome-related information recording medium, and the communication network 1 It may be provided with a genomic-related information recording medium 24 connected to the personal computer 3 via a personal computer.
- the requester accesses the genome-related information recording medium 24 via the communication network 1 and obtains the “polymorphism address” and “polymorphism pattern” recorded on the genome-related information recording medium 24.
- the genome-related information recording medium 24 may be a medium in which genome-related information on a plurality of individuals is recorded for each individual (for each “Gno. J”).
- the present invention is not limited to a configuration in which the shared computer 2 has the main DB 14 as described above.
- the main DB 14 connected to the shared computer 2 via the communication network 1
- the shared computer 2 accesses the main DB 14 via the communication network 1 in the flowcharts shown in FIGS.
- the requester can obtain desired semantic information according to the flowcharts shown in FIGS.
- the shared computer 2 accesses a plurality of main DBs 14 of different organizations or organizations via the communication network 1 and uses the semantic information included in the plurality of main DBs 14.
- information can be provided to the requestor. That is, in the information processing system, in step A4 in the flowcharts shown in FIGS. to access.
- the requester can obtain information on the possibility of colorectal cancer based on the information contained in the various main DBs 14.
- the shared computer 2 sends at least a personal computer to a so-called agent.
- the request information received from the third party may be transmitted to obtain the semantic information (in this example, “possibility of colon cancer”) via the agent.
- the polymorphism pattern may be encrypted or unencrypted.
- an encrypted polymorphism pattern may be received from among the encrypted polymorphism patterns associated with the polymorphism address, or may be decrypted from among the encrypted polymorphism patterns and then multiplied.
- the type pattern may be received, or the polymorphic pattern may be received after being encrypted from among the unencrypted polymorphic patterns.
- an encrypted polymorphism pattern may be acquired and a polymorphism pattern may be transmitted in an encrypted state, or an encrypted polymorphism pattern may be acquired and decrypted before the polymorphism pattern is acquired. May be transmitted, or a non-encrypted polymorphism pattern may be acquired and encrypted before being transmitted.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Bioinformatics & Computational Biology (AREA)
- Chemical & Material Sciences (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Theoretical Computer Science (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Medical Treatment And Welfare Office Work (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2003281108A AU2003281108A1 (en) | 2002-07-15 | 2003-07-15 | Information processing system using base sequence relevant information |
US10/521,351 US7747394B2 (en) | 2002-07-15 | 2003-07-15 | Information processing system using base sequence relevant information |
EP03741402A EP1553512A4 (en) | 2002-07-15 | 2003-07-15 | INFORMATION PROCESSING SYSTEM USING INFORMATION RELATING TO BASIC SEQUENCES |
CA002492658A CA2492658A1 (en) | 2002-07-15 | 2003-07-15 | Information processing system using base sequence relevant information |
US12/785,188 US8364416B2 (en) | 2002-07-15 | 2010-05-21 | Information processing system using base sequence relevant information |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2002-205505 | 2002-07-15 | ||
JP2002205505A JP3677258B2 (ja) | 2002-07-15 | 2002-07-15 | 塩基配列関連情報を用いた情報処理システム |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/785,188 Continuation US8364416B2 (en) | 2002-07-15 | 2010-05-21 | Information processing system using base sequence relevant information |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2004008361A1 true WO2004008361A1 (ja) | 2004-01-22 |
Family
ID=30112767
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2003/008971 WO2004008361A1 (ja) | 2002-07-15 | 2003-07-15 | 塩基配列関連情報を用いた情報処理システム |
Country Status (7)
Country | Link |
---|---|
US (2) | US7747394B2 (ja) |
EP (1) | EP1553512A4 (ja) |
JP (1) | JP3677258B2 (ja) |
AU (1) | AU2003281108A1 (ja) |
CA (1) | CA2492658A1 (ja) |
TW (1) | TWI289264B (ja) |
WO (1) | WO2004008361A1 (ja) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2387277C (en) * | 2001-05-25 | 2015-03-03 | Hitachi, Ltd. | Information processing system using nucleotide sequence-related information |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000067139A (ja) * | 1998-08-25 | 2000-03-03 | Hitachi Ltd | 電子カルテシステム |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5985559A (en) | 1997-04-30 | 1999-11-16 | Health Hero Network | System and method for preventing, diagnosing, and treating genetic and pathogen-caused disease |
US5479513A (en) | 1994-11-18 | 1995-12-26 | Martin Marietta Energy Systems, Inc. | Fast and secure encryption-decryption method based on chaotic dynamics |
US5917913A (en) | 1996-12-04 | 1999-06-29 | Wang; Ynjiun Paul | Portable electronic authorization devices and methods therefor |
US5970500A (en) | 1996-12-12 | 1999-10-19 | Incyte Pharmaceuticals, Inc. | Database and system for determining, storing and displaying gene locus information |
US6537747B1 (en) | 1998-02-03 | 2003-03-25 | Lucent Technologies Inc. | Data transmission using DNA oligomers |
US6304262B1 (en) * | 1998-07-21 | 2001-10-16 | Raytheon Company | Information security analysis system |
EP1208421A4 (en) * | 1999-06-25 | 2004-10-20 | Genaissance Pharmaceuticals | PROCESS FOR MAINTAINING AND USING HAPLOTYPE DATA |
AU1367201A (en) | 1999-10-01 | 2001-05-10 | Orchid Biosciences, Inc. | Method and system for providing genotype clinical information over a computer network |
CA2387780C (en) | 1999-10-15 | 2011-10-11 | W. Jean Dodds | Animal health diagnosis |
GB2363874B (en) * | 1999-11-06 | 2004-08-04 | Dennis Sunga Fernandez | Bioinformatic transaction scheme |
EP1250676A2 (en) | 1999-12-22 | 2002-10-23 | Neurografix | System, method and article of manufacture for managing a medical services network |
US20030208454A1 (en) | 2000-03-16 | 2003-11-06 | Rienhoff Hugh Y. | Method and system for populating a database for further medical characterization |
US6874085B1 (en) | 2000-05-15 | 2005-03-29 | Imedica Corp. | Medical records data security system |
US20020010552A1 (en) | 2000-05-26 | 2002-01-24 | Hugh Rienhoff | System for genetically characterizing an individual for evaluation using genetic and phenotypic variation over a wide area network |
US6300872B1 (en) * | 2000-06-20 | 2001-10-09 | Philips Electronics North America Corp. | Object proximity/security adaptive event detection |
WO2002017190A1 (en) | 2000-08-22 | 2002-02-28 | Varro Technologies, Inc. | Method and system for sharing biological information |
JPWO2002025519A1 (ja) * | 2000-09-20 | 2004-01-29 | 株式会社東芝 | 遺伝子による診療情報提供方法、診療情報提供端末及び診療情報受給端末 |
WO2002025528A1 (en) | 2000-09-21 | 2002-03-28 | Theradoc.Com, Inc. | Systems and methods for manipulating medical data via a decision support system |
US6931604B2 (en) * | 2000-12-18 | 2005-08-16 | Derek Graham Lane | Method of navigating a collection of interconnected nodes |
-
2002
- 2002-07-15 JP JP2002205505A patent/JP3677258B2/ja not_active Expired - Fee Related
-
2003
- 2003-04-03 TW TW092107619A patent/TWI289264B/zh not_active IP Right Cessation
- 2003-07-15 EP EP03741402A patent/EP1553512A4/en not_active Withdrawn
- 2003-07-15 CA CA002492658A patent/CA2492658A1/en not_active Abandoned
- 2003-07-15 WO PCT/JP2003/008971 patent/WO2004008361A1/ja not_active Application Discontinuation
- 2003-07-15 AU AU2003281108A patent/AU2003281108A1/en not_active Abandoned
- 2003-07-15 US US10/521,351 patent/US7747394B2/en not_active Expired - Lifetime
-
2010
- 2010-05-21 US US12/785,188 patent/US8364416B2/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000067139A (ja) * | 1998-08-25 | 2000-03-03 | Hitachi Ltd | 電子カルテシステム |
Also Published As
Publication number | Publication date |
---|---|
US20100292975A1 (en) | 2010-11-18 |
TWI289264B (en) | 2007-11-01 |
TW200401218A (en) | 2004-01-16 |
JP3677258B2 (ja) | 2005-07-27 |
EP1553512A1 (en) | 2005-07-13 |
US20060147917A1 (en) | 2006-07-06 |
AU2003281108A1 (en) | 2004-02-02 |
CA2492658A1 (en) | 2004-01-22 |
EP1553512A4 (en) | 2006-06-28 |
US7747394B2 (en) | 2010-06-29 |
US8364416B2 (en) | 2013-01-29 |
JP2004046696A (ja) | 2004-02-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI363309B (en) | Genetic analysis systems, methods and on-line portal | |
Madera et al. | The SUPERFAMILY database in 2004: additions and improvements | |
US8571810B2 (en) | Information processing system using nucleotide sequence-related information | |
Schaafsma et al. | V ari SNP, a benchmark database for variations from db SNP | |
US20090240441A1 (en) | System and method for analysis and presentation of genomic data | |
Singh et al. | From the reference human genome to human pangenome: Premise, promise and challenge | |
JP3888999B2 (ja) | 塩基配列関連情報を用いた情報処理システム | |
WO2004008361A1 (ja) | 塩基配列関連情報を用いた情報処理システム | |
Sanchez-Villeda et al. | DNAAlignEditor: DNA alignment editor tool | |
Ouellette | Internet resources for the clinical geneticist | |
US7337071B2 (en) | Information processing system using base sequence-related information | |
JP4365822B2 (ja) | 塩基配列関連情報を用いた情報処理システム | |
JP3677275B2 (ja) | 塩基配列関連情報を用いた情報処理システム | |
Alzu'bi | A personal genomic information analysis and management system for healthcare purposes | |
JP4266008B2 (ja) | 塩基配列関連情報を用いた情報処理システム | |
JP2024080945A (ja) | 意思決定支援システム、方法およびプログラム | |
Xu et al. | SNPTrack TM: an integrated bioinformatics system for genetic association studies | |
WO2004111909A1 (ja) | 塩基配列関連情報を用いた情報処理システム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
ENP | Entry into the national phase |
Ref document number: 2006147917 Country of ref document: US Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10521351 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2492658 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2003741402 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2003741402 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 10521351 Country of ref document: US |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2003741402 Country of ref document: EP |