CN111597161A - Information processing system, information processing method and device - Google Patents

Information processing system, information processing method and device Download PDF

Info

Publication number
CN111597161A
CN111597161A CN202010463344.1A CN202010463344A CN111597161A CN 111597161 A CN111597161 A CN 111597161A CN 202010463344 A CN202010463344 A CN 202010463344A CN 111597161 A CN111597161 A CN 111597161A
Authority
CN
China
Prior art keywords
information
disease
database
global
information processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010463344.1A
Other languages
Chinese (zh)
Inventor
余欢
陈明
苏庆东
李莉
李玉洁
郑琪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Nuohe Zhiyuan Biotechnology Co ltd
Nanjing Novogene Biotechnology Co ltd
Tianjin Novogene Biological Information Technology Co ltd
Tianjin Nuohe Medical Examination Institute Co ltd
Beijing Novogene Technology Co ltd
Original Assignee
Beijing Nuohe Zhiyuan Biotechnology Co ltd
Nanjing Novogene Biotechnology Co ltd
Tianjin Novogene Biological Information Technology Co ltd
Tianjin Nuohe Medical Examination Institute Co ltd
Beijing Novogene Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Nuohe Zhiyuan Biotechnology Co ltd, Nanjing Novogene Biotechnology Co ltd, Tianjin Novogene Biological Information Technology Co ltd, Tianjin Nuohe Medical Examination Institute Co ltd, Beijing Novogene Technology Co ltd filed Critical Beijing Nuohe Zhiyuan Biotechnology Co ltd
Priority to CN202010463344.1A priority Critical patent/CN111597161A/en
Publication of CN111597161A publication Critical patent/CN111597161A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • G06F16/144Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/50Mutagenesis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H15/00ICT specially adapted for medical reports, e.g. generation or transmission thereof

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Medical Informatics (AREA)
  • Library & Information Science (AREA)
  • Analytical Chemistry (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Epidemiology (AREA)
  • Artificial Intelligence (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Human Computer Interaction (AREA)
  • Chemical & Material Sciences (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an information processing system, an information processing method and an information processing device. Wherein, this information processing system includes: the data processing module is used for acquiring local disease information from a first database and storing the local disease information into a second database, wherein the number of the first databases is multiple, and different information of diseases is stored in each first database; the data query module is used for searching the second database based on the disease search term to obtain global disease information; the language type conversion module is used for calling the translation plug-in to carry out language type conversion on the global disease information to obtain the global target disease information; and the information display module is used for displaying the global information of the target disease. The invention solves the technical problems that in the related technology, databases related to disease research are dispersed, and most of the languages of the databases are English, so that better query experience cannot be provided for users.

Description

Information processing system, information processing method and device
Technical Field
The present invention relates to the field of computer technologies, and in particular, to an information processing system, an information processing method, and an information processing apparatus.
Background
With the popularization of second-generation sequencing, while a large amount of human genome data is generated, the progress of disease research is promoted, part of difficult and complicated diseases are solved, and part of genetic patients are benefited. However, it is a difficult problem for researchers, genetic consultants or ordinary people how to apply and mine such information. The research of genetic diseases often depends on various databases such as OMIM, HGMD, Pubmed and the like to a great extent, and at present, a comprehensive database introduced from genetic diseases, related gene description, mutation description and application is not available, and a language-friendly comprehensive database is not available. These existing individual databases are often distributed, diversified and weighted, single database information is not comprehensive, and almost all databases of this type are english, and other limitations, which raise the threshold for using these data. The data management, viewing, utilization and interpretation of the data bring great difficulty for researchers or genetic consultants, and the data information understanding of ordinary people is hindered. For example, a whole genome sequencing sample can generate 300-400 ten thousand SNP mutation data, although professional researchers can use special tools or software to perform mining and primary screening on the mutation data, and obtain dozens of hundreds of candidate results. However, it is difficult to interpret disease information and candidate mutation information in detail later on with some tools. Researchers or genetic consultants need to query a large amount of databases and documents and arrange various data to read subsequent results, the stage is usually the most time-consuming and tedious stage, and the inconvenience of information acquisition brings difficulty to the work, so that the application threshold is improved.
In addition, the existing databases related to disease research are dispersed and are all specific language versions, so that updating is not timely, and the method is inconvenient for researchers, genetic consultants and common people to inquire and utilize.
Aiming at the problems that in the related technology, databases related to disease research are dispersed, and most of the languages of the databases are English, so that better query experience cannot be provided for users, an effective solution is not provided at present.
Disclosure of Invention
The embodiment of the invention provides an information processing system, an information processing method and an information processing device, which are used for at least solving the technical problems that in the related technology, databases related to disease research are dispersed, most of the languages of the databases are English, and better query experience cannot be provided for a user.
According to an aspect of an embodiment of the present invention, there is provided an information processing system including: the data processing module is used for acquiring local disease information from a first database and storing the local disease information into a second database, wherein the number of the first databases is multiple, and different information of diseases is stored in each first database; the data query module is used for searching the second database based on the disease search term to obtain global disease information; the language type conversion module is used for calling the translation plug-in to carry out language type conversion on the global disease information to obtain global target disease information; and the information display module is used for displaying the global information of the target disease.
Optionally, the first database comprises: an online human mendelian genetic database, a human gene mutation database, and other life science domain databases; the second database is a distributed file storage database.
Optionally, the information processing system further comprises: a visualization processing module for generating a predetermined chart based on the characteristics of the target disease global information, wherein the predetermined chart comprises at least one of: and generating a disease variation graph based on the target disease global information, and a distribution state graph of the disease in the population.
Optionally, the information processing system further comprises: a report generation module, configured to generate a disease query report based on the disease global information or the target disease global information, where the disease query report includes at least one of: the probability that the predetermined subject has the disease, and the site where the predetermined subject has the disease.
Optionally, the information processing system further comprises: and the data input module is used for inputting the disease search word.
Optionally, the data processing module is further configured to perform predetermined processing on the local disease information acquired from the first database, and store the processed local disease information in the second database, where the predetermined processing includes at least one of: filtering operation, classifying operation and annotating operation.
Optionally, the data processing module includes: and the verification submodule is used for verifying the correction information submitted by the data input module so as to obtain the validity of the correction information.
Optionally, the data processing module further includes: and the updating submodule is used for updating the second database by using the correction information under the condition that the validity of the correction information reaches a preset value.
According to another aspect of the embodiments of the present invention, there is provided an information processing method applied to the information processing system described in any one of the above, including: responding to the query request, entering a disease information query task, and acquiring a disease search term carried in the query request; searching a second database to obtain disease global information based on the disease search term, wherein the second database stores local disease information obtained from a first database, the first database is multiple, and different information of diseases is stored in each first database; performing language type conversion on the disease global information to obtain target disease global information; and displaying the global information of the target disease.
Optionally, before the obtaining of the disease search term carried in the query request, the information processing method further includes: acquiring login information input through a data input module of the information processing system; and determining that the login information is successfully verified.
Optionally, before the presenting the global information of the target disease, the information processing method further includes at least one of: generating a disease query report based on the disease global information or the target disease global information, wherein the disease query report includes at least one of: determining the probability that the predetermined subject has the disease, determining the site where the predetermined subject has the disease; generating a predetermined chart based on the disease global information or the target disease global information, wherein the predetermined chart comprises at least one of: and generating a disease variation graph based on the target disease global information, and a distribution state graph of the disease in the population.
Optionally, the information processing method further includes: acquiring correction information input through a data input module of the information processing system; verifying the correction information to obtain the validity of the correction information; and determining that the validity of the correction information reaches a preset value, and updating the second database by using the correction information.
According to another aspect of the embodiments of the present invention, there is also provided an information processing apparatus using the information processing method described in any one of the above, including: the first acquisition unit is used for responding to the query request, entering a disease information query task and acquiring a disease search term carried in the query request; the searching unit is used for searching a second database to obtain global disease information based on the disease search term, wherein the second database stores local disease information obtained from a first database, the number of the first databases is multiple, and different information of diseases is stored in each first database; the conversion unit is used for carrying out language type conversion on the disease global information to obtain target disease global information; and the display unit is used for displaying the global information of the target disease.
Optionally, the information processing apparatus further includes: a second obtaining unit, configured to obtain login information input through a data input module of the information processing system before obtaining the disease search term carried in the query request; and the determining unit is used for determining that the login information is successfully verified.
Optionally, the information processing apparatus further includes at least one of: a first generating unit, configured to generate a disease query report based on the disease global information or the target disease global information before the presenting of the target disease global information, where the disease query report includes at least one of: determining the probability that the predetermined subject has the disease, determining the site where the predetermined subject has the disease; a second generating unit, configured to generate a predetermined chart based on the disease global information or the target disease global information, where the predetermined chart includes at least one of: and generating a disease variation graph based on the target disease global information, and a distribution state graph of the disease in the population.
Optionally, the information processing apparatus further includes: a third acquisition unit configured to acquire correction information input through a data input module of the information processing system; the fourth acquisition unit is used for verifying the correction information to obtain the validity of the correction information; and the updating unit is used for determining that the validity of the correction information reaches a preset value and updating the second database by using the correction information.
According to another aspect of the embodiments of the present invention, there is also provided a storage medium including a stored program, wherein the program executes the information processing method of any one of the above.
According to another aspect of the embodiments of the present invention, there is also provided a processor, configured to execute a program, where the program executes to perform the information processing method described in any one of the above.
According to another aspect of the embodiments of the present invention, there is also provided an information processing apparatus including: a memory, a processor coupled with the memory, the memory and the processor communicating over a bus system; the memory is used for storing a program, wherein the program controls the equipment where the memory is located to execute any one of the information processing methods when being executed by the processor; the processor is configured to execute a program, where the program executes the information processing method according to any one of the above descriptions.
In the embodiment of the invention, a data processing module is adopted and is used for acquiring local disease information from a first database and storing the local disease information into a second database, wherein the number of the first databases is multiple, and different information of diseases is stored in each first database; the data query module is used for searching the second database based on the disease search term to obtain global disease information; the language type conversion module is used for calling the translation plug-in to carry out language type conversion on the global disease information to obtain the global target disease information; the information display module is used for displaying global information of a target disease, and the information processing system provided by the embodiment of the invention realizes the purpose of integrating and summarizing local information of diseases in a plurality of databases and then storing the information into one database so as to facilitate a user to search and obtain relatively complete related information of the disease, achieves the technical effect of improving the experience of the user, simultaneously improves the response speed of the information processing system, reduces the requirements on the user, and further solves the technical problems that related databases for disease research in related technologies are relatively dispersed, and the database language is mostly English, so that better query experience cannot be provided for the user.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
FIG. 1 is a flow chart of an information processing method according to an embodiment of the present invention;
FIG. 2 is a block diagram of an information handling system according to an embodiment of the present invention;
FIG. 3 is a flow chart of an information processing method according to an embodiment of the present invention;
fig. 4 is a schematic diagram of an information processing apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
For none of the above comprehensive introductions from genetic diseases, relevant databases based on description, mutation description, and application, much less language-friendly comprehensive databases; the existing single database is often not systematic enough due to the fact that information of the existing single database is often dispersed, and is diversified and has a strong emphasis, single database information is not comprehensive, language types can not be switched, and other limitations exist, so that the threshold of using the data is improved, and great inconvenience is brought to users.
The invention provides a comprehensive and complete database of disease knowledge and information, and simultaneously provides functions of exporting, automatically translating English into Chinese, updating at any time and the like, thereby facilitating scientific researchers, genetic consultants and common people to obtain and utilize the existing scientific achievements and assisting in genetic screening. The following examples are given below.
According to an aspect of an embodiment of the present invention, there is provided an information processing system, and fig. 1 is a schematic diagram of an information processing system according to an embodiment of the present invention, as shown in fig. 1, the information processing system including: a data processing module 11, a data query module 13, a language type conversion module 15 and an information presentation module 17. The information processing system will be explained below.
The data processing module 11 is configured to obtain local disease information from a first database, and store the local disease information in a second database, where the number of the first databases is multiple, and different information about diseases is stored in each of the first databases.
Optionally, the first database is a plurality of databases, and they store data of different diseases, for example, the first database stores disease introduction information of different diseases, the first and second databases store gene description information of different diseases, and the first and third databases store mutation description information of different diseases. The local information of diseases is stored in the first database, namely, the local information is only information of a certain aspect of diseases, and global information which is comprehensive of diseases is not provided.
Optionally, the first database may include: an online human mendelian genetic database, a human gene mutation database, and other life science domain databases; the second database is a distributed file storage database. Wherein, the online human Mendelian genetic database, namely OMIM database, mainly records all known genetic diseases, genetically determined traits and genes thereof; the human gene mutation database, namely, the HGMD database is mainly used for collecting and organizing pathogenic sites which are closely related to human genetic diseases in published documents, and aims to collect pathogenic mutations or related mutations of the published human genetic diseases; other life domain databases, which may be PubMed databases, are mainly databases that provide biomedical article search and summarization, and are freely searched, and the database source is MEDLINE.
Alternatively, the second database may be a distributed file storage database, for example, a MongoDB database.
Optionally, the local disease information represents information describing a certain aspect of a disease; for example, disease introduction information, gene-related information of a disease, mutation information of a disease, and the like.
In an alternative embodiment, the data processing module 11 obtains local disease information from the first database through an interface between the first database and the second database, integrates the obtained local disease information, and transmits the integrated information to the second database, so that the second database stores more complete information about a plurality of different diseases, such as introduction information about diseases, gene-related information about diseases, mutation information about diseases, and the like.
That is, the data processing module 11 may be configured to perform integration and summary processing on the local disease information acquired from the first database to obtain global disease information, and store the global disease information in the second database. Specifically, the data processing module can integrate the information of different aspects of the diseases acquired from the plurality of first databases to obtain more complete and comprehensive information of the diseases, so that a user can obtain more complete information of the diseases to be understood through one-time retrieval.
And the data query module 13 is used for searching the second database to obtain global disease information based on the disease search term.
Optionally, the disease search term may be a term related to a certain disease, which is determined by a user for querying information of the disease, for example, a disease name, a disease abbreviation, and the like may be used as a term for searching information of the disease.
In an alternative embodiment, after the user inputs the disease search term, the data query module may search the second database for more comprehensive information about the disease based on the disease search term without the information processing system invoking multiple databases to obtain information about different aspects of the disease.
And the language type conversion module 15 is used for calling the translation plug-in to perform language type conversion on the global disease information to obtain the global target disease information.
Optionally, the translation plug-in may perform conversion between different languages, and after obtaining the disease global information based on the search of the data query module 13, may perform language type conversion on the disease global information based on the requirement of the user to obtain the target disease global information of the language type that meets the requirement of the user and is convenient for the user to browse.
And the information display module 17 is used for displaying the global information of the target diseases.
As can be seen from the above, in the embodiment of the present invention, the data processing module may be used to obtain local disease information from the first database, and store the local disease information in the second database, where the number of the first databases is multiple, and each of the first databases stores different information about a disease; the data query module can also be used for searching the second database based on the disease search term to obtain global disease information; meanwhile, a language type conversion module can be used for calling a translation plug-in to perform language type conversion on the global disease information to obtain the global target disease information; the information display module is used for displaying the global information of the target disease, so that the local information of the diseases in the databases is integrated and summarized and then is stored in one database, a user can conveniently search and obtain relatively complete related information of the diseases, the technical effect of improving the experience of the user is achieved, meanwhile, the response speed of the information processing system is improved, and the requirements for the user are reduced.
Therefore, the information processing system provided by the embodiment of the invention solves the technical problems that databases related to disease research in the related art are relatively dispersed, and the languages of the databases are mostly English, so that better query experience cannot be provided for users.
In an optional embodiment, the information processing system may further include: a visualization processing module for generating a predetermined chart based on the characteristics of the target disease global information, wherein the predetermined chart comprises at least one of: a disease variation graph generated based on the target disease global information, and a distribution state graph of the disease in the population.
In this embodiment, the predetermined chart may be drawn based on the characteristics of the global information of the target disease (e.g., variation of the disease, distribution of the disease among people, people susceptible to the disease, time of occurrence of the disease, and areas of occurrence of the disease) obtained by query, so that the user can obtain the related information of the disease more intuitively.
In an alternative embodiment, the information processing system further comprises: a report generation module, configured to generate a disease query report based on the disease global information or the target disease global information, where the disease query report includes at least one of: the probability that the predetermined subject has the disease, and the site where the predetermined subject has the disease.
In this embodiment, a disease query report may be generated based on the disease global information or the target disease global information; for example, based on the information input by the user, the information may be compared with the data stored in the second database, so that the probability that the predetermined subject has a disease may be obtained, and the point location from which the predetermined subject is determined to have a disease may also be obtained.
In an alternative embodiment, the information processing system further comprises: and the data input module is used for inputting the disease search terms.
In this embodiment, a data input module may be provided in the information processing system, and a user may input a disease search term through the data input module to perform a query.
In an optional embodiment, the data processing module is further configured to perform predetermined processing on the local disease information acquired from the first database, and store the processed local disease information in the second database, where the predetermined processing includes at least one of: filtering operation, classifying operation and annotating operation.
In this embodiment, since there may be some redundant information, incomplete information items, etc. in the local disease information obtained from the first database, in order to ensure the reliability of the disease information stored in the second database, after obtaining the local disease information from the first database, a predetermined process is performed on the local disease information, for example, the incomplete information items are filtered out, and annotation information is added to the site information and stored in the second database in a good form.
Note information may be added to the bit point information by software such as annovar.
In an alternative embodiment, the data processing module comprises: and the verification submodule is used for verifying the correction information submitted by the data input module so as to obtain the validity of the correction information.
In this embodiment, after the user acquires the global information of the target disease, the validity, authenticity, reliability, and the like of the acquired global information of the target disease can be verified based on self knowledge storage or experience; specifically, when the user thinks that the acquired target disease global information has the problems of validity, authenticity, reliability and the like, the correction information can be submitted through the data input module, and at the moment, the verification sub-module can verify the correction information submitted by the user to obtain the validity of the correction information.
In an optional embodiment, the data processing module further comprises: and the updating submodule is used for updating the second database by using the correction information under the condition that the validity of the correction information reaches a preset value.
In this embodiment, the second database may be updated with the correction information to ensure validity, authenticity, and reliability of the second database.
In addition, the update sub-module may also obtain the latest disease information from another database or a web page in real time to update the second database.
Fig. 2 is a block diagram of an information processing system according to an embodiment of the present invention, which may include, as shown in fig. 2: a front end frame that interacts with a user; and the background processing framework is used for processing data. A user can input information such as a personal account number, a password and the like through a login module in a front-end frame to login into the information processing system and enter into a personal page; then, entering a database website, and starting a background framework of a second database, wherein the second database acquires local disease information from the first database (such as HGMD database data, OMIM database data and Pubmed database data), collects, integrates and stores the local disease information; the background processing framework can also perform logic processing, report generation, drawing and the like so as to meet the data acquisition and display requirements of the front-end framework.
The information processing system mainly comprises four parts, namely a self-writing background code, a server, a database and a front-end display website, wherein the server can select Nginx or Apache, the database can select SQLite or MySQL, a back-end framework can select Django, and a development version can be a 2.2 version; in addition, the information processing system of the embodiment of the invention has no limit to the operating system and can be used in a cross-platform mode.
The back end can acquire knowledge and information of a disease related database through the self-writing codes, store the knowledge and information in MongoDB, realize interaction of front-end webpages through a Django framework, and realize other extended functions through the self-writing software codes, thereby realizing more functions in the future.
By developing software and codes, the information processing system provided by the embodiment of the invention integrates functions of data mining, data correction and updating, automatic language translation and the like into a set of software codes by integrating international universal mutation database, disease knowledge base and automatic translation system related to genetic disease research, forms a set of comprehensive inquiry and display system of genetic disease knowledge and mutation information, can construct an authoritative and perfect comprehensive disease knowledge base, enables non-professional predetermined objects to conveniently and comprehensively inquire and acquire related resources of genetic diseases, reduces the threshold of scientific research students, genetic consultants and common people for analyzing and understanding genetic information, and assists in diagnosis and treatment of genetic diseases. Has the following advantages:
1) a comprehensive and comprehensive disease information knowledge base is constructed: acquiring information of each level related to diseases, including background information, gene information related to diseases, mutation information and the like, for example, integrating information of an authorized Wei database such as an OMIM database, an HGMD database, a Pubmed database and the like, and performing the operation to ensure the comprehensiveness, authority and accuracy of the result;
2) automatic translation of data: because the database and the knowledge base related to the genetic disease research are almost all constructed in English, the system can automatically translate data information after being used, supports the provision of Chinese and English reports, and is convenient for various types of people to use and understand;
3) the operation is simple and the result is clear: the inquiry port of the webpage end inputs information related to the inquiry disease, so that disease description information, related pathogenic gene information, reported pathogenic mutation information and the like of a comprehensive system can be displayed and can be exported;
4) convenient data correction function: although the authority of each database is very strong, the evidence strength of part of articles does not always meet the requirement, the document updating speed is very high, the public database cannot be updated timely, the accuracy and novelty can be judged according to the query result in the process of actually using the system by a predetermined object, the update is submitted at a webpage end, and the corrected data information is automatically updated after the update is applied again later;
5) potential for scalability: many extensible applications may also be generated in the future based on the present invention, such as predefined object entry custom sites, interpretation through a comprehensive database, and reporting.
According to an embodiment of the present invention, there is provided a method embodiment of an information processing method, it should be noted that, when the information processing method is applied to an information processing system of any one of the above, the steps shown in the flowchart of the figure may be executed in a computer system such as a set of computer executable instructions, and although a logical order is shown in the flowchart, in some cases, the steps shown or described may be executed in an order different from that here.
Fig. 3 is a flowchart of an information processing method according to an embodiment of the present invention, as shown in fig. 3, the information processing method including the steps of:
step S302, responding to the query request, entering a disease information query task, and acquiring a disease search term carried in the query request.
Optionally, the query request may be a request triggered by triggering the query control after the user inputs the disease search term through the data input module of the information processing system.
Step S304, searching and obtaining disease global information from a second database based on the disease search term, wherein the second database stores local disease information obtained from the first database, the first database is multiple, and different information of diseases is stored in each first database.
Optionally, the first database is a plurality of databases, and they store data of different diseases, for example, the first database stores disease introduction information of different diseases, the first and second databases store gene description information of different diseases, and the first and third databases store mutation description information of different diseases. The local information of diseases is stored in the first database, namely, the local information is only information of a certain aspect of diseases, and global information which is comprehensive of diseases is not provided.
Optionally, the first database may include: an online human Mendelian genetic database, a human gene mutation database, a life science domain database; the second database is a distributed file storage database. Wherein, the online human Mendelian genetic database, namely OMIM database, mainly records all known genetic diseases, genetically determined traits and genes thereof; the human gene mutation database, namely, the HGMD database is mainly used for collecting and organizing pathogenic sites which are closely related to human genetic diseases in published documents, and aims to collect pathogenic mutations or related mutations of the published human genetic diseases; the life domain database, which may be a PubMed database, is a database that provides biomedical paper search and summarization and is freely searched, and the source of the database is MEDLINE.
Alternatively, the second database may be a distributed file storage database, for example, a MongoDB database.
Optionally, the local disease information represents information describing a certain aspect of a disease; for example, disease introduction information, gene-related information of a disease, mutation information of a disease, and the like.
In an optional embodiment, the data processing module obtains local disease information from the first database through an interface between the first database and the second database, integrates the obtained local disease information, and transmits the integrated information to the second database, so that the second database stores more complete information of a plurality of different diseases, such as introduction information of the diseases, gene-related information of the diseases, mutation information of the diseases, and the like.
That is, the data processing module may be configured to perform integration and summary processing on the local disease information acquired from the first database to obtain global disease information, and store the global disease information in the second database. Specifically, the data processing module can integrate the information of different aspects of the diseases acquired from the plurality of first databases to obtain more complete and comprehensive information of the diseases, so that a user can obtain more complete information of the diseases to be understood through one-time retrieval.
And step S306, performing language type conversion on the disease global information to obtain target disease global information.
Optionally, in the embodiment of the present invention, the translation plug-in may be invoked to translate the disease global information based on a user's requirement, that is, after the disease global information is obtained based on a search of the data query module, the language type of the disease global information may be converted based on the user's requirement, so as to obtain the target disease global information of the language type that meets the user's requirement and is convenient for the user to browse.
And step S308, displaying the global information of the target disease.
As can be seen from the above, in the embodiment of the present invention, a disease information query task may be entered in response to a query request, and a disease search term carried in the query request may be acquired; searching from a second database to obtain disease global information based on the disease search term, wherein the second database stores local disease information obtained from a first database, the first database is multiple, and different information of diseases is stored in each first database; carrying out language type conversion on the global disease information to obtain the global target disease information; the global information of the target disease is displayed, the local information of the diseases in a plurality of databases is integrated and summarized and then is stored in one database, so that a user can conveniently search and obtain relatively complete related information of the diseases, the technical effect of improving the experience of the user is achieved, meanwhile, the response speed of an information processing system is improved, and the requirement on the user is reduced.
Therefore, the information processing method provided by the embodiment of the invention solves the technical problems that databases related to disease research in the related art are relatively dispersed, and the languages of the databases are mostly English, so that better query experience cannot be provided for users.
In an optional embodiment, before obtaining the disease search term carried in the query request, the information processing method may further include: acquiring login information input through a data input module of the information processing system; and determining that the login information is successfully verified.
In an optional embodiment, before presenting the global information of the target disease, the information processing method may further include at least one of: generating a disease query report based on the disease global information or the target disease global information, wherein the disease query report includes at least one of: determining the probability that the predetermined subject has the disease, determining the site where the predetermined subject has the disease; generating a predetermined chart based on the disease global information or the target disease global information, wherein the predetermined chart comprises at least one of: a disease variation graph generated based on the target disease global information, and a distribution state graph of the disease in the population.
In this embodiment, a disease query report may be generated based on the disease global information or the target disease global information; for example, based on the information input by the user, the information may be compared with the data stored in the second database, so that the probability that the predetermined subject has a disease may be obtained, and the point location from which the predetermined subject is determined to have a disease may also be obtained.
In addition, in this embodiment, the predetermined chart may be drawn based on the characteristics of the global information of the target disease (e.g., variation of the disease, distribution of the disease among people, people susceptible to the disease, time when the disease is frequently occurring, and areas where the disease is frequently occurring), so that the user can obtain the information related to the disease more intuitively.
In an optional embodiment, the information processing method may further include: acquiring correction information input through a data input module of the information processing system; verifying the correction information to obtain the validity of the correction information; and determining that the validity of the correction information reaches a preset value, and updating the second database by using the correction information.
In this embodiment, after the user acquires the global information of the target disease, the validity, authenticity, reliability, and the like of the acquired global information of the target disease can be verified based on self knowledge storage or experience; specifically, when the user thinks that the acquired target disease global information has the problems of validity, authenticity, reliability and the like, the correction information can be submitted through the data input module, and at the moment, the correction information submitted by the user can be verified through the verification sub-module so as to obtain the validity of the correction information.
In this embodiment, the second database may be updated with the correction information to ensure validity, authenticity, and reliability of the second database.
In addition, the second database may be updated by acquiring the latest disease information from another database, a web page, or the like in real time.
In the embodiment of the invention, various databases related to the genetic disease can be integrated through software codes, comprehensive information such as genetic disease background knowledge, related mutant genes, mutant sites and the like is covered, an integration system of genetic disease knowledge and information is formed, automatic Chinese display can be realized, inquiry and processing of professional and non-professional data processing personnel on genetic data are realized, the threshold for acquiring and utilizing scientific research results by scientific researchers, genetic consultants and common people is lowered, and genetic screening is assisted.
In addition, in the embodiment of the invention, self-writing software or processing data can be utilized, specifically, Python self-writing software codes can be utilized to obtain data of various databases from the internet, and the data are arranged and translated and then stored in a MongoDB database to form a comprehensive disease knowledge and information database; deployment of cloud servers and installation of applications: and installing a dependence application on the cloud server, building a database website, and building a dependence Django framework on the database website. Under the Django framework, several application modules are formed through self-writing software, a diseasejsearch application module is used for inquiring information of a database, a diseasejreport application module is used for generating a related report, and other applications also include chromosome _ plot and diseasedb _ api which are respectively used for drawing and interacting with the database. The predetermined object can then access the homepage through the browser.
In the embodiment of the invention, the self-writing software is used for acquiring and translating the basic information, the description information, the gene information related to diseases, the specific related mutation information and other information of diseases from the public databases such as OMIM, HGMD and Pubmed, and the like, so as to form a comprehensive disease knowledge base from the basic information of the diseases to the information for explaining the pathogenic mutation of the diseases. The PHP is then used at the front end to build up a web page for presentation and management. Then, the predetermined object searches related diseases and phenotypes in a constructed database by using a character string matching algorithm based on the disease name input by the webpage end, can simultaneously acquire comprehensive information such as genes and loci related to the diseases or phenotypes, and can perform derivation and related extended application.
The data of the software is mainly sourced from international universal disease databases such as OMIM, HGMD, Pubmed and the like, the data types are information such as disease basic information, disease and gene, or disease, gene and locus association and the like, the data are subjected to careful filtering and annotation before being stored, items with incomplete information are filtered, annotation information is added to the locus information by using Annovar software, and the data are stored in the MongoDB database in a good form.
In order to realize data query, a set of interface software special for querying and updating disease gene associated data is compiled, the interface software can run on a Linux system, is connected with a MongoDB database, and queries data through a pymongo module. After the predetermined object inputs the name of the disease to be searched, the closest name of the disease or phenotype is found by means of a character string matching algorithm provided by a fuzzy wuzzy module, and after the predetermined object selects the needed phenotype, all database records related to the phenotype are extracted and fed back to the predetermined object.
In order to ensure that the content of the database is advanced with time, on one hand, the software has the function of updating the database, on the other hand, the predetermined object can also feed back new data, the modification of the data is submitted through the form of the front-end page, the modified content is returned to the background and finally fed back and recorded in the database, the information of the submitter and the submission time are included, and the modified content is recorded in the database.
The acquisition of the relevant information of the literature is derived from the literature information of a pre-collected NCBI Pubmed database, meanwhile, the contents such as the abstract and the like are translated by using Google translation, the translated contents are stored in a MongoDB database, and the related literature is extracted every query.
The display and the function of the front-end page are mainly realized by depending on Javascript + HTML + CSS, and relate to JQuery, datacles, Bootstrap libraries and the like. Due to the fact that the displayed content is large in number, in order to provide good browsing experience for the preset object, asynchronous requests are conducted on the displayed content in a blocking mode, and rapid page loading and function separation are guaranteed.
The information processing method provided by the embodiment of the invention is more comprehensive and comprehensive compared with an OMIM database, and covers complete and comprehensive key information required by genetic disease research, such as basic disease information, gene mutation information, site mutation information and the like; compared with an OMIM database, the method is more friendly, can be provided with Chinese and English double versions, and can also be provided with more other versions based on requirements; compared with an OMIM database, the method is more convenient to use, and the export and update of a webpage end can be realized; all information of an OMIM database is covered; the method has better expandability, can improve more knowledge information in the future, increases functions such as custom input of disease names and mutation information, automatic interpretation by using the knowledge base generated by the method, report issuing and the like.
According to another aspect of an embodiment of the present invention, there is also provided an information processing apparatus using the information processing method of any one of the above, and fig. 4 is a schematic diagram of the information processing apparatus according to an embodiment of the present invention, as shown in fig. 4, the information processing apparatus may include: a first obtaining unit 41, a searching unit 43, a converting unit 45 and a presenting unit 47. The information processing apparatus will be described in detail below.
The first obtaining unit 41 is configured to enter a disease information query task in response to the query request, and obtain a disease search term carried in the query request.
The searching unit 43 is configured to search the second database for global disease information based on the disease search term, where the second data stores local disease information obtained from the first databases, the number of the first databases is multiple, and different information about diseases is stored in each of the first databases.
And the conversion unit 45 is used for performing language type conversion on the disease global information to obtain target disease global information.
And the display unit 47 is used for displaying the global information of the target diseases.
It should be noted here that the first obtaining unit 41, the searching unit 43, the converting unit 45 and the presenting unit 47 correspond to steps S302 to S308 in the embodiment, and the above units are the same as the examples and application scenarios realized by the corresponding steps, but are not limited to the disclosure of the above embodiment. It should be noted that the above-described elements as part of an apparatus may be implemented in a computer system, such as a set of computer-executable instructions.
As can be seen from the above, in the above embodiment of the present application, the first obtaining unit may be used to respond to the query request, enter a disease information query task, and obtain the disease search term carried in the query request; then searching a second database by using a searching unit based on the disease search word to obtain global disease information, wherein the second database stores local disease information obtained from the first database, the first databases are multiple, and different information of diseases is stored in each first database; performing language type conversion on the disease global information by using a conversion unit to obtain target disease global information; and displaying the global information of the target disease by using the display unit. By the information processing device provided by the embodiment of the invention, the purpose of integrating and summarizing the local disease information in a plurality of databases and storing the information in one database is realized, so that a user can search and obtain relatively complete disease related information conveniently, the technical effect of improving the experience of the user is achieved, the response speed of an information processing system is improved, the requirement on the user is reduced, and the technical problems that in the related technology, the databases related to disease research are relatively dispersed, and the database language is mostly English, so that better query experience cannot be provided for the user are solved.
In an optional embodiment, the information processing apparatus further includes: the second acquisition unit is used for acquiring login information input by a data input module of the information processing system before acquiring the disease search term carried in the query request; and the determining unit is used for determining that the login information is successfully verified.
In an optional embodiment, the information processing apparatus further comprises at least one of: the first generating unit is used for generating a disease query report based on the disease global information or the target disease global information before the target disease global information is displayed, wherein the disease query report comprises at least one of the following: determining the probability that the predetermined subject has the disease, determining the site where the predetermined subject has the disease; a second generating unit, configured to generate a predetermined chart based on the disease global information or the target disease global information, where the predetermined chart includes at least one of: a disease variation graph generated based on the target disease global information, and a distribution state graph of the disease in the population.
In an optional embodiment, the information processing apparatus further includes: a third acquisition unit configured to acquire correction information input through a data input module of the information processing system; the fourth acquisition unit is used for verifying the correction information to obtain the validity of the correction information; and the updating unit is used for determining that the validity of the correction information reaches a preset value and updating the second database by using the correction information.
According to another aspect of the embodiments of the present invention, there is also provided a storage medium including a stored program, wherein the program executes the information processing method of any one of the above.
According to another aspect of the embodiments of the present invention, there is also provided a processor, configured to execute a program, where the program executes an information processing method according to any one of the foregoing methods.
According to another aspect of the embodiments of the present invention, there is also provided an information processing apparatus including: a memory, a processor coupled to the memory, the memory and the processor communicating via a bus system; the memory is used for storing a program, wherein the program controls the equipment where the memory is located to execute any one of the information processing methods when being executed by the processor; the processor is used for running the program, wherein the program executes the information processing method of any one of the above items when running.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.
In the above embodiments of the present invention, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
In the embodiments provided in the present application, it should be understood that the disclosed technology can be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units may be a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.
The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and decorations can be made without departing from the principle of the present invention, and these modifications and decorations should also be regarded as the protection scope of the present invention.

Claims (16)

1. An information processing system, comprising:
the data processing module is used for acquiring local disease information from a first database and storing the local disease information into a second database, wherein the number of the first databases is multiple, and different information of diseases is stored in each first database;
the data query module is used for searching the second database based on the disease search term to obtain global disease information;
the language type conversion module is used for calling the translation plug-in to carry out language type conversion on the global disease information to obtain global target disease information;
and the information display module is used for displaying the global information of the target disease.
2. The information handling system of claim 1, wherein the first database comprises: an online human Mendelian genetic database, a human gene mutation database, a life science domain database; the second database is a distributed file storage database.
3. The information processing system according to claim 1, further comprising: a visualization processing module for generating a predetermined chart based on the characteristics of the target disease global information, wherein the predetermined chart comprises at least one of: and generating a disease variation graph based on the target disease global information, and a distribution state graph of the disease in the population.
4. The information processing system according to claim 1, further comprising: a report generation module, configured to generate a disease query report based on the disease global information or the target disease global information, where the disease query report includes at least one of: the probability that the predetermined subject has the disease, and the site where the predetermined subject has the disease.
5. The information processing system according to claim 1, further comprising: and the data input module is used for inputting the disease search word.
6. The information processing system of claim 1, wherein the data processing module is further configured to perform a predetermined process on the local disease information obtained from the first database, and store the processed local disease information in the second database, wherein the predetermined process includes at least one of: filtering operation, classifying operation and annotating operation.
7. The information processing system of claim 5, wherein the data processing module comprises: and the verification submodule is used for verifying the correction information submitted by the data input module so as to obtain the validity of the correction information.
8. The information processing system of claim 7, wherein the data processing module further comprises: and the updating submodule is used for updating the second database by using the correction information under the condition that the validity of the correction information reaches a preset value.
9. An information processing method applied to the information processing system according to any one of claims 1 to 8, comprising:
responding to the query request, entering a disease information query task, and acquiring a disease search term carried in the query request;
searching a second database to obtain disease global information based on the disease search term, wherein the second database stores local disease information obtained from a first database, the first database is multiple, and different information of diseases is stored in each first database;
performing language type conversion on the disease global information to obtain target disease global information;
and displaying the global information of the target disease.
10. The method of claim 9, wherein prior to obtaining the disease term carried in the query request, the method further comprises:
acquiring login information input through a data input module of the information processing system;
and determining that the login information is successfully verified.
11. The method of claim 9, wherein prior to said presenting said target disease global information, said method further comprises at least one of:
generating a disease query report based on the disease global information or the target disease global information, wherein the disease query report includes at least one of: determining the probability that the predetermined subject has the disease, determining the site where the predetermined subject has the disease;
generating a predetermined chart based on the disease global information or the target disease global information, wherein the predetermined chart comprises at least one of: and generating a disease variation graph based on the target disease global information, and a distribution state graph of the disease in the population.
12. The method of claim 9, further comprising:
acquiring correction information input through a data input module of the information processing system;
verifying the correction information to obtain the validity of the correction information;
and determining that the validity of the correction information reaches a preset value, and updating the second database by using the correction information.
13. An information processing apparatus using the information processing method according to any one of claims 9 to 12, comprising:
the first acquisition unit is used for responding to the query request, entering a disease information query task and acquiring a disease search term carried in the query request;
the searching unit is used for searching a second database to obtain global disease information based on the disease search term, wherein the second database stores local disease information obtained from a first database, the number of the first databases is multiple, and different information of diseases is stored in each first database;
the conversion unit is used for carrying out language type conversion on the disease global information to obtain target disease global information;
and the display unit is used for displaying the global information of the target disease.
14. A storage medium characterized by comprising a stored program, wherein the program executes the information processing method of any one of claims 9 to 12.
15. A processor, configured to execute a program, wherein the program executes to perform the information processing method according to any one of claims 9 to 12.
16. An information processing apparatus characterized by comprising:
a memory, a processor coupled with the memory, the memory and the processor communicating over a bus system;
the memory is used for storing a program, wherein the program controls the device where the memory is located to execute the information processing method according to any one of claims 9 to 12 when being executed by the processor;
the processor is configured to execute a program, wherein the program executes the information processing method according to any one of claims 9 to 12.
CN202010463344.1A 2020-05-27 2020-05-27 Information processing system, information processing method and device Pending CN111597161A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010463344.1A CN111597161A (en) 2020-05-27 2020-05-27 Information processing system, information processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010463344.1A CN111597161A (en) 2020-05-27 2020-05-27 Information processing system, information processing method and device

Publications (1)

Publication Number Publication Date
CN111597161A true CN111597161A (en) 2020-08-28

Family

ID=72187934

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010463344.1A Pending CN111597161A (en) 2020-05-27 2020-05-27 Information processing system, information processing method and device

Country Status (1)

Country Link
CN (1) CN111597161A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112489729A (en) * 2020-12-04 2021-03-12 北京诺禾致源科技股份有限公司 Gene data query method and device and nonvolatile storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101989297A (en) * 2009-07-30 2011-03-23 陈越 System for excavating medicine related with disease gene in computer
CN103294682A (en) * 2012-02-24 2013-09-11 摩根全球购物有限公司 Multi-language retrieving method, computer readable storage medium and network searching system
US8655645B1 (en) * 2011-05-10 2014-02-18 Google Inc. Systems and methods for translation of application metadata
CN105229649A (en) * 2013-03-15 2016-01-06 百世嘉(上海)医疗技术有限公司 For the human genome analysis of variance of disease association and the system and method for report
CN105740243A (en) * 2014-12-08 2016-07-06 深圳华大基因研究院 Method and device for constructing biological information database
CN107169310A (en) * 2017-03-20 2017-09-15 上海基银生物科技有限公司 A kind of genetic test construction of knowledge base method and system
CN109036572A (en) * 2018-06-29 2018-12-18 迈凯基因科技有限公司 A kind of multiple database exchange method and device
CN109086570A (en) * 2018-06-29 2018-12-25 迈凯基因科技有限公司 A kind of multiple database successively exchange method and device
CN110570905A (en) * 2019-07-22 2019-12-13 中国人民解放军总医院 method and device for constructing omics data analysis platform and computer equipment

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101989297A (en) * 2009-07-30 2011-03-23 陈越 System for excavating medicine related with disease gene in computer
US8655645B1 (en) * 2011-05-10 2014-02-18 Google Inc. Systems and methods for translation of application metadata
CN103294682A (en) * 2012-02-24 2013-09-11 摩根全球购物有限公司 Multi-language retrieving method, computer readable storage medium and network searching system
CN105229649A (en) * 2013-03-15 2016-01-06 百世嘉(上海)医疗技术有限公司 For the human genome analysis of variance of disease association and the system and method for report
CN105740243A (en) * 2014-12-08 2016-07-06 深圳华大基因研究院 Method and device for constructing biological information database
CN107169310A (en) * 2017-03-20 2017-09-15 上海基银生物科技有限公司 A kind of genetic test construction of knowledge base method and system
CN109036572A (en) * 2018-06-29 2018-12-18 迈凯基因科技有限公司 A kind of multiple database exchange method and device
CN109086570A (en) * 2018-06-29 2018-12-25 迈凯基因科技有限公司 A kind of multiple database successively exchange method and device
CN110570905A (en) * 2019-07-22 2019-12-13 中国人民解放军总医院 method and device for constructing omics data analysis platform and computer equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112489729A (en) * 2020-12-04 2021-03-12 北京诺禾致源科技股份有限公司 Gene data query method and device and nonvolatile storage medium

Similar Documents

Publication Publication Date Title
Mazandu et al. Gene ontology semantic similarity tools: survey on features and challenges for biological knowledge discovery
US20190220490A1 (en) Combining website characteristics in an automatically generated website
AU2009238294B2 (en) Data transformation based on a technical design document
US20090210787A1 (en) Document data managing method, managing system, and computer software
US20050131649A1 (en) Advanced databasing system for chemical, molecular and cellular biology
US20090007271A1 (en) Identifying attributes of aggregated data
JP2011501847A (en) Computer-implemented method
US20210004586A1 (en) Methods, systems, and storage media for automatically identifying relevant chemical compounds in patent documents
JP2011513810A (en) Term identification method and apparatus
US20150106701A1 (en) Input support method and information processing system
CN111581349A (en) Semantic retrieval method and system for patent literature
Bailey et al. GAIA: framework annotation of genomic sequence
Bessant et al. Building Bioinformatics Solutions
CN111597161A (en) Information processing system, information processing method and device
Li et al. Building an efficient curation workflow for the Arabidopsis literature corpus
CN111158973B (en) Web application dynamic evolution monitoring method
JP2021144348A (en) Information processing device and information processing method
Pavesi et al. Using Weeder for the discovery of conserved transcription factor binding sites
JP2006023968A (en) Unique expression extracting method and device and program to be used for the same
Chatr-Aryamontri et al. Searching the protein interaction space through the MINT database
JP4398461B2 (en) Literature retrieval device, literature retrieval method, and literature retrieval program
CN113377367B (en) Data collection method, device, computer equipment and storage medium
Bedi et al. Development of Biological Databases for Genomic Research
JP4886014B2 (en) Literature retrieval device, literature retrieval method, and literature retrieval program
Ozyer et al. KoExPubMed: A Tool for Effective and Customized Knowledge Extraction from PubMed

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination