CN108256284A

CN108256284A - A kind of drug virtual screening method

Info

Publication number: CN108256284A
Application number: CN201810002901.2A
Authority: CN
Inventors: 李家辉; 陈品; 张曦
Original assignee: National Sun Yat Sen University
Current assignee: Sun Yat Sen University; National Sun Yat Sen University
Priority date: 2018-01-02
Filing date: 2018-01-02
Publication date: 2018-07-06

Abstract

The present invention relates to a kind of drug virtual screening method, screening object includes multiple candidate compounds, includes the following steps：S1, database is written into the information of all candidate compound molecules；S2, the corresponding record of one of candidate compound molecule is taken out from database；S3, the record of taking-up is stored in being locally stored for calculate node as input file；S4, screening analysis is carried out to input file, and analysis result is written to being locally stored for calculate node；S5, middle reading is locally stored from calculate node in analysis result, is inserted into database in a manner that one records；S6, another is taken out from database there is no a corresponding record of processed candidate compound molecule, return to step S3, until all processing of the corresponding record of all candidate compound molecules are completed.The present invention has been transferred to the load for being locally stored in database server, reducing meta data server of calculate node by that will bear, and ensure that stable system performance.

Description

A kind of drug virtual screening method

Technical field

The invention belongs to data management fields, and in particular to a kind of drug virtual screening method.

Background technology

Drug virtual screening refers to during medicament research and development, before bioactivity screening is carried out, on computers Prescreening is carried out to compound molecule, to reduce practical screening compounds number, while lead compound is improved and finds efficiency. During virtual screening, screening sequence needs to analyze a candidate compounds up to a million successively, obtains the compound Scoring.Wherein, some screening sequences can be stored in each candidate small molecule in individual file, defeated as one of them Enter；Meanwhile the appraisal result of screening can be also stored in an independent file.Therefore, an each pair of candidate compound point Son is screened, and at least needs to manage two small documents.

Current drug virtual screening mainly carries out in High Performance Computing Cluster and supercomputer, because of screening sequence It is run in calculate node, relevant compound molecule data file need be stored directly in cluster and supercomputer Globally shared storage file system on, just can guarantee that these files are accessed in each selected calculate node, and complete One drug virtual screening flow needs management to be stored in million small documents in globally shared storage file system.

The globally shared storage file system that present cluster and supercomputer use, such as Lustre file system, not The a large amount of small documents of management are good at, even if candidate Medicine small molecule is divided into multiple groups, each group is screened successively, not only The concurrent scale of screening is limited, the same time can only screen one of which, and even if being grouped, and drug is empty Intending the relevant large amount of small documents of screening still can be stored directly on global file system, and the metadata of file system can be caused to take Business device load too high, causes file system performance to decline to a great extent, influences the operation of cluster and supercomputer.

Invention content

The defects of in order to overcome the prior art, the present invention, which provides one kind, can reduce meta data server load, maintainer A kind of drug virtual screening method that performance of uniting is stablized.

For above-mentioned technical problem, the present invention solves in this way：A kind of drug virtual screening method screens object Including multiple candidate compounds, include the following steps：

S1, database is written into the information of all candidate compound molecules；

S2, the corresponding record of one of candidate compound molecule is taken out from database；

S3, the record of taking-up is stored in being locally stored for calculate node as input file；

S4, screening analysis is carried out to input file, and analysis result is written to being locally stored for calculate node；

S5, middle reading is locally stored from calculate node in analysis result, is inserted into database in a manner that one records；

S6, another is taken out from database there is no a corresponding record of processed candidate compound molecule, return to step S3, directly It is completed to all processing of the corresponding record of all candidate compound molecules.

Compared with the prior art, the present invention by way of a record, is written using each candidate compound molecule To database, and in the processing procedure of calculate node, the file of generation is stored in being locally stored for calculate node, avoids Large amount of small documents is preserved in globally shared storage file system, alleviates the burden of meta data server, and phase is locally stored Than being more convenient for extending in meta data server, flexibility is good, does not interfere with High Performance Computing Cluster and supercomputer system also Stability；In addition, analysis result can be specifically inserted into a manner of a field in database in a manner that one records, After these analysis results deposit database, the convenience of these data analysis mining processes can be promoted, such as can be easily Algorithm directly is ranked up to these analysis results, unlike analysis result is first read out just from file in the prior art It can processing.

Further, the step S1 is specially：One is created in the database to include at least index, molecular name and divide Each candidate compound molecule is written to the table or set by the table or set of three fields of minor structure information In.

Compared with the prior art, beneficial effects of the present invention are：By candidate compound molecule and the analysis result to it By the storage of the form of record in the database, file is then converted to when in use to be stored in being locally stored of calculate node, It is stored directly in not as file in the globally shared storage file system of cluster and supercomputer, burden is transferred to The load for being locally stored in database server, reducing meta data server of calculate node, ensure that system performance Stability.

Description of the drawings

Fig. 1 is the flow chart of the method for the present invention.

Specific embodiment

With reference to specific embodiment and attached drawing, the present invention is described in detail.

A kind of drug virtual screening method as shown in Figure 1, screening object includes multiple candidate compounds, including walking as follows Suddenly：

In specific implementation process, step S1 is：One is created in MongoDB databases and includes at least index, molecule name Claim the table or set with three fields of molecular structure information, candidate compound molecule is taken out from ZINC databases, and pass through energy Each candidate compound molecule is written to the table or set by the software of enough read-write MongoDB databases；Step Suddenly S2 is：One of candidate compound is taken out from MongoDB databases by the software that can read and write MongoDB databases The corresponding record of molecule；Step S3 is：The record of taking-up is stored in being locally stored for calculate node as input file In Ramdisk；Step S4 is：Screening software AutodockVina carries out screening analysis, and analysis result is write to input file Enter to calculate node and Ramdisk is locally stored；Step S5 is：The software of MongoDB databases can be read and write by analysis result It reads from being locally stored in Ramdisk for calculate node, is inserted into MongoDB databases in a manner that one records；Step S6 is：Another is taken out from MongoDB databases by the software that can read and write MongoDB databases does not have processed time Select the corresponding record of compound molecule, return to step S3, until all processing of the corresponding record of all candidate compound molecules are completed.

Claims

1. a kind of drug virtual screening method, screening object includes multiple candidate compounds, which is characterized in that including walking as follows Suddenly：

2. a kind of drug virtual screening method according to claim 1, which is characterized in that the step S1 is specially： One is created in database including at least index, the table or set of three fields of molecular name and molecular structure information, it will be each Candidate compound molecule is written to as a record in the table or set.