CN109872780A - A kind of determination method and device of chemical synthesis route - Google Patents

A kind of determination method and device of chemical synthesis route Download PDF

Info

Publication number
CN109872780A
CN109872780A CN201910194016.3A CN201910194016A CN109872780A CN 109872780 A CN109872780 A CN 109872780A CN 201910194016 A CN201910194016 A CN 201910194016A CN 109872780 A CN109872780 A CN 109872780A
Authority
CN
China
Prior art keywords
target compound
compound
chemical synthesis
response rule
disaggregated model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910194016.3A
Other languages
Chinese (zh)
Inventor
杨阳
刘海宾
张北辰
郭良越
冯佳欣
王春兰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Deep System Yao Technology Co Ltd
Original Assignee
Beijing Deep System Yao Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Deep System Yao Technology Co Ltd filed Critical Beijing Deep System Yao Technology Co Ltd
Priority to CN201910194016.3A priority Critical patent/CN109872780A/en
Publication of CN109872780A publication Critical patent/CN109872780A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

This application discloses a kind of determination methods of chemical synthesis route, comprising: receives chemical synthesis request, the chemical synthesis request is used for the chemical synthesis route of request target compound;The target compound is input to disaggregated model, obtains response rule corresponding with the target compound;The disaggregated model is with compound for input, and take response rule as the machine learning model of output;The target compound is carried out according to the response rule to decompose raw material needed for retrosynthetic analysis obtains synthesising target compound;Chemical synthesis route corresponding with the target compound is determined according to the raw material.This method realizes chemical synthesis route Automated Design, and disaggregated model accuracy with higher and stronger generalization ability by disaggregated model, can be applied to new reaction, without training again, so improves chemical synthesis route design efficiency.Disclosed herein as well is the training method of disaggregated model, device, equipment and media.

Description

A kind of determination method and device of chemical synthesis route
Technical field
This application involves the determination method and devices of chemical field more particularly to a kind of chemical synthesis route.
Background technique
With the rapid development of organic chemistry, the research for synthesizing chemistry is constantly broken through, and the design of synthetic route is changed in chemistry Work, pharmacy, the status in the multiple fields such as material become most important.One good synthetic route, can not only allow chemical work Make personnel can quick synthesising target compound, and bigger profit can be brought for industrial sector.
Currently, most of chemical synthesis route design is to rely on the knowledge and experience of chemist itself.And with New reaction, new reagent continue to bring out, and that many chemists often can not consider many factors in design route is thorough, Such as yield height, how much is synthesis step, safety and the feature of environmental protection etc..
Related software on the market can only generally provide the design of single step reaction when designing synthetic route, although can be with The working efficiency of chemists is promoted, but still to carry out screening step by step according to artificial experience.Up to now, not yet A full automatic complete synthetic route design software moves towards market.
Certainly, industry has had the method for Part Full Automatic complete synthetic route design to be suggested, including is based on similitude The chemical synthesis route design method of retrieval, the chemical synthesis route design method based on chemical reaction rule either are based on covering The chemical synthesis route method of special Carlow tree search.Wherein, the chemical synthesis route design method based on similarity retrieval can only Compound with similar structure is designed, covers not comprehensively, there is biggish limitation;Based on chemical reaction rule Chemical synthesis route design method is inconsiderate complete, thus accuracy is lower;Chemical synthesis route based on Monte Carlo tree is set Meter method calculating speed is extremely slow, and time-consuming.
Therefore it provides the chemical synthesis route of a kind of high efficiency, high accuracy determines that method becomes urgent problem to be solved.
Summary of the invention
In view of this, this application provides a kind of determination method of chemical synthesis route, this method is by machine learning application It is designed in retrosynthetic analysis and chemical synthesis route, the train classification models by way of machine learning, for anti-to chemistry It answers rule to be predicted, there is stronger accuracy and preferable Generalization Capability, and can be realized full automatic chemical synthesis Highway route design.Accordingly, present invention also provides a kind of training method of disaggregated model, the determining device of chemical synthesis route, Training device, equipment, storage medium and the computer program product of disaggregated model.
The application first aspect provides a kind of determination method of chemical synthesis route, which comprises
Chemical synthesis request is received, the chemical synthesis request is used for the chemical synthesis route of request target compound;
The target compound is input to disaggregated model, obtains response rule corresponding with the target compound;Institute Stating disaggregated model is with compound for input, and take response rule as the machine learning model of output;
Decomposition retrosynthetic analysis is carried out to the target compound according to the response rule to obtain synthesizing the targeted Raw material needed for closing object;
Chemical synthesis route corresponding with the target compound is determined according to the raw material.
Optionally, it is described according to the response rule to the target compound carry out decompose retrosynthetic analysis synthesized Raw material needed for the target compound includes:
Decomposition retrosynthetic analysis is carried out to the target compound according to the response rule and obtains decomposition product;
Whether search the decomposition product from raw database with the determination decomposition product is raw material;
Decomposition product is not raw material if it exists, then using the decomposition product as target compound, executes the target compound It is input to disaggregated model, obtains response rule corresponding with the target compound, and according to the response rule to the mesh The step of mark compound carries out decomposition retrosynthetic analysis, until the decomposition product is raw material.
Optionally, the method also includes:
It receives and the amendment that the chemical synthesis route is modified is requested;
It is requested in response to the amendment, corrects the chemical synthesis route.
Optionally, the method also includes:
The target compound is converted into corresponding vector using term vector model;
It is then described the target compound is input to disaggregated model to include:
The corresponding vector of the target compound is input to disaggregated model.
Optionally, the machine learning model includes gradient decline decision-tree model, Random Forest model, support vector machines Any one or more in model and neural network model.
The application second aspect provides a kind of training method of disaggregated model, which comprises
Training sample is obtained, the training sample includes compound and reaction information relevant to the compound, institute Reaction information is stated including at least response rule;
According to the training sample, using machine learning algorithm train classification models, until meeting training termination condition;
Wherein, the disaggregated model is input with compound, is output with response rule.
Optionally, response rule obtains in the following way in the training sample:
Acquire chemical equation;
Atomic series mark is carried out to the chemical equation, and based in the chemical equation abstraction reaction after mark The heart;
The reaction center is filtered and clustering processing, response rule is determined according to processing result.
Optionally, the compound in the training sample exists in the form of vectors, and the corresponding vector of the compound passes through As under type obtains:
The chemical formula of compound each in the chemical equation of acquisition is converted into the minor structure that radius is 1 using root algorithm is rubbed Sequence;
All minor structure sequences are converted into the corresponding vector of the minor structure sequence using term vector model;
For each compound, the corresponding vector of minor structure sequence of the compound is summed up to obtain the chemical combination The corresponding vector of object.
Optionally, the machine learning model includes gradient decline decision Tree algorithms, random forests algorithm, support vector machines Any one or more in algorithm and neural network algorithm.
The application third aspect provides a kind of determining device of chemical synthesis route, and described device includes:
Receiving module, for receiving chemical synthesis request, the chemical synthesis request is used for the change of request target compound Learn synthetic route;
Module is obtained, for the target compound to be input to disaggregated model, is obtained corresponding with the target compound Response rule;The disaggregated model is with compound for input, and take response rule as the machine learning model of output;
Decomposing module is closed for carrying out decomposition retrosynthetic analysis to the target compound according to the response rule At raw material needed for the target compound;
Determining module, for determining chemical synthesis route corresponding with the target compound according to the raw material.
Optionally, the decomposing module is specifically used for;
Decomposition retrosynthetic analysis is carried out to the target compound according to the response rule and obtains decomposition product;
Whether search the decomposition product from raw database with the determination decomposition product is raw material;
Decomposition product is not raw material if it exists, then using the decomposition product as target compound, executes the target compound It is input to disaggregated model, obtains response rule corresponding with the target compound, and according to the response rule to the mesh The step of mark compound carries out decomposition retrosynthetic analysis, until the decomposition product is raw material.
Optionally, the receiving module is also used to:
It receives and the amendment that the chemical synthesis route is modified is requested;
Described device further include:
Correction module corrects the chemical synthesis route for requesting in response to the amendment.
Optionally, described device further include:
Conversion module, for the target compound to be converted to corresponding vector using term vector model;
Then the acquisition module is specifically used for when target compound is input to disaggregated model:
The corresponding vector of the target compound is input to disaggregated model.
Optionally, the machine learning model includes gradient decline decision-tree model, Random Forest model, support vector machines Any one or more in model and neural network model.
The application fourth aspect provides a kind of training device of disaggregated model, and described device includes:
Module is obtained, for obtaining training sample, the training sample includes compound and related to the compound Reaction information, the reaction information include at least response rule;
Training module, for using machine learning algorithm train classification models, being instructed until meeting according to the training sample Practice termination condition;
Wherein, the disaggregated model is input with compound, is output with response rule.
Optionally, described device further include:
Acquisition module, for acquiring chemical equation;
Abstraction module, for carrying out atomic series mark to the chemical equation, and based on the chemical reaction after mark Formula abstraction reaction center;
Processing module determines reaction rule according to processing result for being filtered to the reaction center and clustering processing Then.
Optionally, the compound in the training sample exists in the form of vectors, described device further include:
First conversion module, for being converted the chemical formula of compound each in the chemical equation of acquisition using root algorithm is rubbed The minor structure sequence for being 1 for radius;
Second conversion module, for all minor structure sequences to be converted to the minor structure sequence pair using term vector model The vector answered;
Module is summed it up, for being directed to each compound, the corresponding vector of minor structure sequence of the compound is added Vector corresponding with the compound is obtained.
Optionally, the machine learning model includes gradient decline decision Tree algorithms, random forests algorithm, support vector machines Any one or more in algorithm and neural network algorithm.
The 5th aspect of the application provides a kind of equipment, and the equipment includes processor and memory:
Said program code is transferred to the processor for storing program code by the memory;
The processor is closed for the chemistry according to instruction execution the application first aspect in said program code The training method of disaggregated model described in determination method or the application second aspect at route.
The 5th aspect of the application provides a kind of computer readable storage medium, and the computer readable storage medium is used for Store program code, said program code be used for execute chemical synthesis route described in the application first aspect determination method or The training method of the described in any item disaggregated models of person's the application second aspect.
As can be seen from the above technical solutions, the embodiment of the present application has the advantage that
It is for target compound to be synthesized, its is defeated this application provides a kind of determination method of chemical synthesis route Enter to the disaggregated model by machine learning algorithm training, it is output with response rule which, which is input with compound, In this way, response rule corresponding with target compound can be obtained, which can characterize to form the target compound Chemical reaction carries out decomposition retrosynthetic analysis to target compound based on the response rule and obtains original needed for synthesising target compound Material is then based on the raw material and determines chemical synthesis route corresponding with target compound.It is designed with traditional chemical synthesis route It compares, this method train classification models by way of machine learning, to be used for the corresponding response rule of predictive compound, passes through Decomposition retrosynthetic analysis is carried out to classifying rules, to realize chemical synthesis route Automated Design, and the disaggregated model has Higher accuracy and stronger generalization ability, can be applied to new reaction, without training again, so improve chemical conjunction At highway route design efficiency.
Detailed description of the invention
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of application without any creative labor, may be used also for those of ordinary skill in the art To obtain other drawings based on these drawings.
Fig. 1 is the scene framework figure of the determination method of chemical synthesis route in the embodiment of the present application;
Fig. 2 is the flow chart of the determination method of chemical synthesis route in the embodiment of the present application;
Fig. 3 is the exemplary diagram that chemical synthesis route is determined in the embodiment of the present application;
Fig. 4 is the scene framework figure of the training method of disaggregated model in the embodiment of the present application;
Fig. 5 is the flow chart of the training method of disaggregated model in the embodiment of the present application;
Fig. 6 is the schematic diagram that the minor structure sequence of compound is determined in the embodiment of the present application;
Fig. 7 A, Fig. 7 B, Fig. 7 C are by the determination method of the chemical synthesis route of the embodiment of the present application and by tradition side The schematic diagram of method progress chemical synthesis route design;
Fig. 8 is a structural schematic diagram of the determining device of chemical synthesis route in the embodiment of the present application;
Fig. 9 is a structural schematic diagram of the training device of disaggregated model in the embodiment of the present application;
Figure 10 is a structural schematic diagram of server in the embodiment of the present application;
Figure 11 is a structural schematic diagram of terminal in the embodiment of the present application.
Specific embodiment
In order to make those skilled in the art more fully understand application scheme, below in conjunction in the embodiment of the present application Attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only this Apply for a part of the embodiment, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art exist Every other embodiment obtained under the premise of creative work is not made, shall fall in the protection scope of this application.
The description and claims of this application and term " first ", " second ", " third ", " in above-mentioned attached drawing The (if present)s such as four " are to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should manage The data that solution uses in this way are interchangeable under appropriate circumstances, so that embodiments herein described herein for example can be to remove Sequence other than those of illustrating or describe herein is implemented.In addition, term " includes " and " having " and theirs is any Deformation, it is intended that cover it is non-exclusive include, for example, containing the process, method of a series of steps or units, system, production Product or equipment those of are not necessarily limited to be clearly listed step or unit, but may include be not clearly listed or for this A little process, methods, the other step or units of product or equipment inherently.
It can only be to similar structure for the chemical synthesis route design method in the prior art based on similarity retrieval Compound be designed, based on chemical reaction rule chemical synthesis route design method it is inconsiderate entirely cause accuracy compared with It is low;The technical issues of chemical synthesis route design method calculating speed based on Monte Carlo tree is extremely slow, time-consuming, the application mentions A kind of chemical synthesis route based on machine learning has been supplied to determine method.
This method trains obtained disaggregated model to predict the corresponding response rule of compound using machine learning, base Decomposition retrosynthetic analysis is carried out in the response rule, realizes full automatic complete synthetic route design.Also, this method is used Disaggregated model be to be obtained by machine learning training, accuracy with higher and generalization ability, and it can be applied In new reaction, and it is not limited to the compound of similar structure, without training again, improves chemical synthesis route design effect Rate.
It is appreciated that the determination method of chemical synthesis route provided by the present application can be applied to processing equipment, the processing Equipment can be any equipment with data-handling capacity, including terminal or server.In practical application, processing equipment can Be it is independent, be also possible to multiple equipment and cooperate the cluster to be formed.The determination method of chemical synthesis route can be to apply journey The form of sequence is stored in processing equipment, and processing equipment realizes chemical synthesis route design by executing the application program.
In order to enable the scheme of the application it is clearer, it can be readily appreciated that below in conjunction with concrete scene to chemical synthesis road The determination method of line is introduced.Scene framework figure shown in Figure 1 includes terminal 10 and server 20 in the scene, uses The operation for determining chemical synthesis route is triggered at family by terminal 10, and terminal 10 generates chemical synthesis request in response to user's operation, And chemical synthesis request is sent to server 20, chemical synthesis request is used for the chemical synthesis route of request target compound, Server 20 in response to the chemical synthesis request, the target compound is input to disaggregated model, the disaggregated model be with Compound is input, take response rule as the machine learning model of output, server 20 can obtain corresponding with target compound Response rule, then according to the response rule to target compound carry out decompose retrosynthetic analysis obtain synthesizing the targeted Raw material needed for closing object, determines the corresponding chemical synthesis route of target compound according to the raw material, so realizes that complete chemistry closes At route Automated Design.
Next, by being carried out from determination method of the angle of server to chemical synthesis route provided by the embodiments of the present application It is discussed in detail.
The determination method of chemical synthesis route shown in Figure 2, this method comprises:
S201: chemical synthesis request is received.
The chemical synthesis request is used for the chemical synthesis route of request target compound.It is carried in chemical synthesis request There is the mark of target compound, such as the chemical formula of target compound, can so indicate that server is generated for target compound Chemical synthesis route.
In practical application, user can trigger the operation for determining the chemical synthesis route of target compound by terminal, Terminal in response to aforesaid operations, request, and sends the chemical synthesis to server by the chemical synthesis generated for target compound Request.
S202: being input to disaggregated model for the target compound, obtains reaction rule corresponding with the target compound Then.
The disaggregated model is with compound for input, and take response rule as the machine learning model of output.Server The mark that target compound is chemically extracted in synthesis request, is input to disaggregated model for target compound, thus acquisition and mesh Mark the corresponding response rule of compound.
In view of the data-handling capacity of disaggregated model, in practical application, the compound for being input to disaggregated model is general Exist in the form of vectors.Based on this, target compound is input to disaggregated model and is specifically as follows by server, and targeted is combined into Corresponding vector is converted to, the corresponding vector of the target compound is then input to disaggregated model.
As a kind of possible implementation, server can realize compound vector by term vector model word2vec Change.Specifically, the chemical formula of compound is converted to the minor structure sequence that radius is 1 by the server by utilizing root algorithm that rubs, then sharp All minor structure sequences are converted into the corresponding vector of the minor structure sequence with word2vec, then to the minor structure sequence pair The vector answered sums up to obtain the corresponding vector of compound.
Compound structure is converted into n-dimensional vector by word2vec model, and n is positive integer.It will be exported from the root algorithm that rubs The minor structure of compound be considered as " word ", be considered as " sentence " for compound, be a kind of unsupervised method, be initially not mark Training in the data of note, to obtain the feature vector of minor structure, these feature vectors can be generalized composite vector.Wherein, When carrying out vectorization to compound, word2vec model can use Skip-gram mode, window size window size Parameter can be set to 5, and size insertion dimensional embeddings can be set to 300, in practical application, above-mentioned Parameter can also be selected or be adjusted according to actual needs.It so, it is possible training and obtain the vector of all minor structures, Mei Gehua The vector for closing object directly can be summed it up to obtain by its all minor structure vector.
In the present embodiment, server is response rule prediction to be realized based on disaggregated model, and then realize chemical synthesis road Line design, therefore, the accuracy of disaggregated model and generalization ability are particularly important.Specifically, server, which uses, passes through engineering Algorithm train classification models are practised, the machine learning algorithm includes gradient decline decision Tree algorithms (Gradient Boosting Decision Tree, GBDT), random forests algorithm, in algorithm of support vector machine and neural network algorithm any one or it is more Kind namely the disaggregated model can be in GBDT model, Random Forest model, supporting vector machine model and neural network model Any one or more.The training process of disaggregated model will be introduced in greater detail below, and it will not be described in detail here.
S203: decomposition retrosynthetic analysis is carried out to the target compound according to the response rule and obtains synthesizing the mesh Raw material needed for marking compound.
Specifically, the chemical reaction of response rule characterization, server can use the response rule and carry out to target compound It decomposes retrosynthetic analysis and obtains decomposition product, which is the reactant for forming target compound.It should be noted that service Device carries out decomposing at least one available decomposition product of retrosynthetic analysis, for each decomposition product, server can by The mode that the decomposition product is searched in raw database determines whether the decomposition product is raw material.
Wherein, raw material refers to the material for further processing, and in the present embodiment, raw material is the material that can be directly obtained Material, such as can be by being obtained from nature, or the material that can be obtained by way of purchase, be also possible to user from The material of definition.Server safeguards there is raw database in advance, is stored with resourcing manifest in the raw database, in this way, service Device can determine whether decomposition product is raw material by the resourcing manifest of inquiry raw database storage.
In some possible implementations, synthesis targeted is can be obtained by the primary retrosynthetic analysis that decomposes in server Raw material needed for closing object namely server can be looked by the primary retrosynthetic analysis gained decomposition product that decomposes in raw database It finds, decomposition product is raw material.In other possible implementations, server is obtained by repeatedly decomposing retrosynthetic analysis To raw material needed for synthesising target compound.In other words, server decompose obtained by decomposition retrosynthetic analysis to target compound In the presence of the decomposition product for not being raw material in object, in this way, server can be executed using the decomposition product as goal decomposition object by the mesh Mark compound is input to disaggregated model, obtains response rule corresponding with the target compound, and according to the response rule The step of decomposition retrosynthetic analysis is carried out to the target compound, until the decomposition product is raw material, namely service Device is using the decomposition product as goal decomposition object, return step S202.
In specific implementation, server can use Open-Source Tools and realize to the decomposition retrosynthetic analysis of target compound, Decomposition product corresponding with target compound is obtained, and determines whether the decomposition product is former by searching for the mode of raw database Material.For example, server can use the RunReactants method in RDkit lib, it can obtain the target compound pair The decomposition product answered.
S204: chemical synthesis route corresponding with the target compound is determined according to the raw material.
After determining raw material needed for synthesising target compound, server is according to the raw material and decomposes the anti-of retrosynthetic analysis Chemical synthesis route corresponding with target compound is determined to path.When the raw material is obtained by once decomposing retrosynthetic analysis When, then server can be designed based on the raw material include a chemical reaction chemical synthesis path with synthesising target compound, When the raw material is obtained by repeatedly decomposing retrosynthetic analysis, then server can be based on the inverse of the sequencing for determining raw material Sequence, design include the chemical synthesis path of multiple chemical reactions with synthesising target compound.
In order to make it easy to understand, illustrating below with reference to specific example.
In one example (referring to the left side Fig. 3), the corresponding raw material of target compound C is A and B, wherein A and B is based on one It is secondary decomposing retrosynthetic analysis and determination, in this way, it is A and B in the first specified requirements that server, which can design chemical synthesis path, I.e. 1. lower reaction generates C to condition, and wherein first specified requirements includes the ratio of A and B, temperature, pH value, catalyst etc..
In another example (referring to the right side Fig. 3), the corresponding raw material of target compound G includes A, B, D and F, wherein A, B, D and F is the determination by multiple retrosynthetic analysis, determines raw material F and intermediate E according to target compound G first, secondly Determine raw material D and intermediate C according to intermediate E, raw material A and B then determined according to intermediate C, in this way, server according to It determines the backward of raw material sequencing, determines that chemistry synthesis path is, A and B are in the 2. lower reaction life of the second specified requirements, that is, condition At intermediate C, intermediate C and raw material D, in third specified requirements, that is, condition, 3. lower reaction generates intermediate E, intermediate E and raw material F generates target compound G four specified requirements, that is, condition is 4. lower, wherein the second specified requirements, third specified requirements and the Four specified requirements include but is not limited to ratio, reaction temperature, pH value of reactant etc..
It is appreciated that the method for determining chemical synthesis route provided by the embodiments of the present application is in addition to that can automatically generate chemistry Outside synthetic route, manual intervention can also be received, generate corresponding chemical synthesis route according to actual needs.Specifically, it services Device can also receive the amendment request being modified to the chemical synthesis route, request then in response to the amendment, amendment The chemical synthesis route.
From the foregoing, it will be observed that the embodiment of the present application provides a kind of method of determining chemical synthesis route, for mesh to be synthesized Compound is marked, the disaggregated model by machine learning algorithm training is input to, which is input with compound, with Response rule is output, in this way, response rule corresponding with target compound can be obtained, which can characterize to be formed The chemical reaction of the target compound carries out decomposition retrosynthetic analysis to target compound based on the response rule and obtains synthesizing this Raw material needed for target compound is then based on the raw material and determines chemical synthesis route corresponding with target compound.With it is traditional Chemical synthesis route design is compared, this method train classification models by way of machine learning, to be used for predictive compound pair The response rule answered, by carrying out decomposition retrosynthetic analysis to classifying rules, thus realize chemical synthesis route Automated Design, and And disaggregated model accuracy with higher and stronger generalization ability, it can be applied to new reaction, without training again, So improve chemical synthesis route design efficiency.
The determination method of chemical synthesis route provided by the embodiments of the present application is realized based on disaggregated model, this is based on, The embodiment of the present application also provides a kind of training method of disaggregated model, this method can be using arbitrarily with data-handling capacity Processing equipment, including terminal or server.
Below by taking server as an example, in conjunction with concrete scene being introduced to the training method of the disaggregated model.Referring to fig. 4 Shown in disaggregated model training method scene framework figure, include server 410 in the scene, server 410 is from network Chemical equation is acquired, the chemical equation based on acquisition generates training sample, then utilizes machine according to the training sample Algorithm train classification models, until meeting training termination condition.
In order to enable the technical solution of the application it is clearer, it can be readily appreciated that below from the angle of server to classification mould The training method of type describes in detail.
Referring to Fig. 5, the training method of disaggregated model provided by the embodiments of the present application includes:
S501: training sample is obtained.
The training sample includes compound and reaction information relevant to the compound, and the reaction information is at least Including response rule.In practical application, server can crawl chemistry instead by crawling tool such as crawler from internet Formula is answered, response rule and compound are chemically extracted in reaction equation, to generate training sample.
The embodiment of the present application provides a kind of implementation for extracting response rule, and specifically, collection of server chemistry is anti- Formula is answered, atomic series mark then is carried out to the chemical equation, and based in the chemical equation abstraction reaction after mark The heart determines response rule according to processing result followed by being filtered to the reaction center and clustering processing.In this way, service Device can generate training sample based on the response rule.
Wherein, reaction center refers to participation reaction and on the influential atom of reaction and group.In organic chemical reactions In, there is in a compound the case where there are multiple identical groups, therefore often for the position of clearly specific reactive group, It needs to carry out the atom and group that atomic series mark (atom-atom mapping) carrys out orienting response to chemical equation, so Reaction center extraction is carried out based on the chemical equation after mark afterwards.Since many organic chemical reactions have similar reaction class Type, therefore obtained reaction center needs are filtered and clustering processing, determine response rule according to processing result, it is anti-based on this Answer rule that can form response rule data set.
In practical application, the compound in the training sample exists in the form of vectors, the compound it is corresponding to Amount can obtain in the following way: server by utilizing rubs root algorithm for the chemical formula of compound each in the chemical equation of acquisition The minor structure sequence (referring to Fig. 6) that radius is 1 is converted to, is for certain organic compound, respectively centered on each atom, is found Stain part indicates central atom in minor structure, Fig. 6, and virtual coil is possible be comprising the atom of only minor structure, in this way, can To obtain the minor structure sequence that radius is 1, then, server by utilizing term vector model is converted to all minor structure sequences described Then the corresponding vector of minor structure sequence is directed to each compound, to the corresponding vector of minor structure sequence of the compound into Row adduction obtains the corresponding vector of the compound.
In the present embodiment, after collecting chemical equation, it is contemplated that a kind of compound can there are many expression way, The molecular formula of compound can be normalized.For example, same molecular formula may be expressed as CN2C (=O) N (C) C (=O) C1 =C2N=CN1C or CN1C=NC2=C1C (=O) N (C) C (=O)-N2C can be expressed as following standard after normalization Literary style Cn1cnc2c1c (=O) n (C) c (=O) n2C can substantially reduce data redudancy by so handling, and improve model Training effectiveness and category of model accuracy.
S502: according to the training sample, using machine learning algorithm train classification models, until meeting training terminates item Part.
Wherein, the disaggregated model is input with compound, is output with response rule.In train classification models, clothes Training sample can be grouped by business device at random according to preset ratio, form training set and test set, the centrally stored trained number of training According to test data in test set.Wherein, preset ratio can be arranged according to actual needs, as an example, preset ratio It can be 7:3.
Then, the training data in server by utilizing training set, in conjunction with machine learning algorithm train classification models, specially The classifying quality of disaggregated model is continued to optimize by adjusting parameter, wherein classifying quality can pass through Receiver Operating Characteristics' song Area (the Area surrounded under line (receiver operating characteristic curve, ROC) with reference axis Under Curve, AUC) it is characterized, AUC refers at random when selecting a positive sample and negative sample, current sorting algorithm root Positive sample is come to the probability of negative sample signature according to the score value being calculated, it is clear that AUC value is bigger, and current sorting algorithm is got over It is possible that positive sample is come before negative sample, can preferably classify.When server train classification models meet training knot When beam condition, disaggregated model at this time can be used for execute classification task.Wherein, which can be according to reality Demand and be arranged, such as can be set to the loss function of disaggregated model and be in the loss letter of convergence state or disaggregated model Number is less than preset threshold.
In some possible implementations, the machine learning algorithm includes gradient decline decision Tree algorithms, random gloomy Any one or more in woods algorithm, algorithm of support vector machine and neural network algorithm.In order to make it easy to understand, being declined with gradient For decision Tree algorithms, disaggregated model is established using the method for GBDT, on the training data by adjusting the quantity of regression tree, every The parameters such as depth, loss function, the learning rate of a lone tree continue to optimize more classifying quality AUC, to obtain preferable mould Type.
In addition, server can also carry out the feelings such as over-fitting to trained disaggregated model using the test data in test set Condition verifying, if over-fitting occurs in disaggregated model, further adjusts parameter, to obtain optimal models.
From the foregoing, it will be observed that the embodiment of the present application provides a kind of training method of disaggregated model, this method is to include compound And the training sample that response rule corresponding with compound is formed in interior information, using machine learning algorithm to disaggregated model into Row training can be used for response rule prediction, compound is input to this when disaggregated model meets training termination condition Disaggregated model, the disaggregated model, that is, corresponding response rule of exportable generation chemical combination.Pass through the disaggregated model of this method training Can the corresponding response rule of predictive compound by carrying out decomposition retrosynthetic analysis to classifying rules chemical conjunction may be implemented At route Automated Design, and disaggregated model accuracy with higher and stronger generalization ability, it can be applied to new Reaction, without training again, so improves chemical synthesis route design efficiency.
It should be noted that the embodiment of the present application also utilize the embodiment of the present application training disaggregated model and traditional change It learns synthetic route design method to test (Fig. 7 A, Fig. 7 B and Fig. 7 C) several unknown compounds, base in the embodiment of the present application It is significantly shorter than the chemical synthesis route that traditional approach is designed in the chemical synthesis route of disaggregated model design, conjunction is greatly lowered At the process complexity of target compound.
The above are the determination method of chemical synthesis route provided by the embodiments of the present application, the training method of disaggregated model one A little specific implementations are based on this, and the embodiment of the present application also provides corresponding devices, below will be from the angle of function modoularization Above-mentioned apparatus is introduced.
The structural schematic diagram of the determining device of chemical synthesis route shown in Figure 8, the device 800 include:
Receiving module 810, for receiving chemical synthesis request, the chemical synthesis request is for request target compound Chemical synthesis route;
Module 820 is obtained, for the target compound to be input to disaggregated model, is obtained and the target compound pair The response rule answered;The disaggregated model is with compound for input, and take response rule as the machine learning model of output;
Decomposing module 830 is obtained for carrying out decomposition retrosynthetic analysis to the target compound according to the response rule The raw material to needed for synthesizing the target compound;
Determining module 840, for determining chemical synthesis route corresponding with the target compound according to the raw material.
Optionally, the decomposing module 830 is specifically used for:
Decomposition retrosynthetic analysis is carried out to the target compound according to the response rule and obtains decomposition product;
Whether search the decomposition product from raw database with the determination decomposition product is raw material;
Decomposition product is not raw material if it exists, then using the decomposition product as target compound, executes the target compound It is input to disaggregated model, obtains response rule corresponding with the target compound, and according to the response rule to the mesh The step of mark compound carries out decomposition retrosynthetic analysis, until the decomposition product is raw material.
Optionally, the receiving module 810 is also used to:
It receives and the amendment that the chemical synthesis route is modified is requested;
Described device 800 further include:
Correction module corrects the chemical synthesis route for requesting in response to the amendment.
Optionally, described device 800 further include:
Conversion module, for the target compound to be converted to corresponding vector using term vector model;
Then the acquisition module 820 is specifically used for when target compound is input to disaggregated model:
The corresponding vector of the target compound is input to disaggregated model.
Optionally, the machine learning model includes gradient decline decision-tree model, Random Forest model, support vector machines Any one or more in model and neural network model.
The embodiment of the present application also provides a kind of training device of disaggregated model, structural schematic diagram shown in Figure 9 should Device 900 includes:
Obtain module 910, for obtaining training sample, the training sample include compound and with the compound phase The reaction information of pass, the reaction information include at least response rule;
Training module 920 is used for according to the training sample, using machine learning algorithm train classification models, until full Foot training termination condition;
Wherein, the disaggregated model is input with compound, is output with response rule.
Optionally, described device 900 further include:
Acquisition module, for acquiring chemical equation;
Abstraction module, for carrying out atomic series mark to the chemical equation, and based on the chemical reaction after mark Formula abstraction reaction center;
Processing module determines reaction rule according to processing result for being filtered to the reaction center and clustering processing Then.
Optionally, the compound in the training sample exists in the form of vectors, described device 900 further include:
First conversion module, for being converted the chemical formula of compound each in the chemical equation of acquisition using root algorithm is rubbed The minor structure sequence for being 1 for radius;
Second conversion module, for all minor structure sequences to be converted to the minor structure sequence pair using term vector model The vector answered;
Module is summed it up, for being directed to each compound, the corresponding vector of minor structure sequence of the compound is added Vector corresponding with the compound is obtained.
Optionally, the machine learning model includes gradient decline decision Tree algorithms, random forests algorithm, support vector machines Any one or more in algorithm and neural network algorithm.
Fig. 8 to Fig. 9 from the angle of function modoularization to the determining device of chemical synthesis route provided by the embodiments of the present application, The training device of disaggregated model is introduced, and the embodiment of the present application also provides corresponding equipment, below will be from hardware entities Angle above-mentioned apparatus provided by the embodiments of the present application is introduced.
Figure 10 is a kind of structural schematic diagram of equipment provided by the embodiments of the present application, which can be server, the clothes Business device 1000 can generate bigger difference because configuration or performance are different, may include one or more central processing units (central processing units, CPU) 1022 (for example, one or more processors) and memory 1032, one The storage medium 1030 of a or more than one storage application program 1042 or data 1044 (such as deposit by one or more magnanimity Store up equipment).Wherein, memory 1032 and storage medium 1030 can be of short duration storage or persistent storage.It is stored in storage medium 1030 program may include one or more modules (diagram does not mark), and each module may include in server Series of instructions operation.Further, central processing unit 1022 can be set to communicate with storage medium 1030, in server The series of instructions operation in storage medium 1030 is executed on 1000.
Server 1000 can also include one or more power supplys 1026, one or more wired or wireless nets Network interface 1050, one or more input/output interfaces 1058, and/or, one or more operating systems 1041, example Such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
The step as performed by server can be based on the server architecture shown in Fig. 10 in above-described embodiment.
Wherein, CPU 1022 is for executing following steps:
Chemical synthesis request is received, the chemical synthesis request is used for the chemical synthesis route of request target compound;
The target compound is input to disaggregated model, obtains response rule corresponding with the target compound;Institute Stating disaggregated model is with compound for input, and take response rule as the machine learning model of output;
Decomposition retrosynthetic analysis is carried out to the target compound according to the response rule to obtain synthesizing the targeted Raw material needed for closing object;
Chemical synthesis route corresponding with the target compound is determined according to the raw material.
Optionally, CPU 1022 can be also used for executing times of the determination method of chemical synthesis route in the embodiment of the present application Anticipate a kind of implementation the step of.
The embodiment of the present application also provides a kind of equipment for train classification models, have and equipment phase shown in Figure 10 Same structure, wherein CPU 1022 is for executing following steps:
Training sample is obtained, the training sample includes compound and reaction information relevant to the compound, institute Reaction information is stated including at least response rule;
According to the training sample, using machine learning algorithm train classification models, until meeting training termination condition;
Wherein, the disaggregated model is input with compound, is output with response rule.
Optionally, CPU 1022 can be also used for execute the embodiment of the present application in disaggregated model training method it is any one The step of kind implementation.
The embodiment of the present application also provides another equipment, as shown in figure 11, for ease of description, illustrate only and this Shen Please the relevant part of embodiment, it is disclosed by specific technical details, please refer to the embodiment of the present application method part.The terminal can be with Being includes mobile phone, tablet computer, personal digital assistant (full name in English: Personal Digital Assistant, English contracting Write: PDA), point-of-sale terminal (full name in English: Point of Sales, english abbreviation: POS), any terminal such as vehicle-mounted computer set It is standby, taking the terminal as an example:
Figure 11 shows the block diagram of the part-structure of mobile phone relevant to terminal provided by the embodiments of the present application.With reference to figure 11, mobile phone includes: radio frequency (full name in English: Radio Frequency, english abbreviation: RF) circuit 1110, memory 1120, defeated Enter unit 1130, display unit 1140, sensor 1150, voicefrequency circuit 1160, Wireless Fidelity (full name in English: wireless Fidelity, english abbreviation: WiFi) components such as module 1170, processor 1180 and power supply 1190.Those skilled in the art It is appreciated that handset structure shown in Figure 11 does not constitute the restriction to mobile phone, it may include more more or fewer than illustrating Component perhaps combines certain components or different component layouts.
Memory 1120 can be used for storing software program and module, and processor 1180 is stored in memory by operation 1120 software program and module, thereby executing the various function application and data processing of mobile phone.Memory 1120 can be led It to include storing program area and storage data area, wherein storing program area can be needed for storage program area, at least one function Application program (such as sound-playing function, image player function etc.) etc.;Storage data area, which can be stored, uses institute according to mobile phone Data (such as audio data, phone directory etc.) of creation etc..In addition, memory 1120 may include high random access storage Device, can also include nonvolatile memory, and a for example, at least disk memory, flush memory device or other volatibility are solid State memory device.
Processor 1180 is the control centre of mobile phone, using the various pieces of various interfaces and connection whole mobile phone, By running or execute the software program and/or module that are stored in memory 1120, and calls and be stored in memory 1120 Interior data execute the various functions and processing data of mobile phone, to carry out integral monitoring to mobile phone.Optionally, processor 1180 may include one or more processing units;Preferably, processor 1180 can integrate application processor and modulation /demodulation processing Device, wherein the main processing operation system of application processor, user interface and application program etc., modem processor is mainly located Reason wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 1180.
In the embodiment of the present application, processor 1180 included by the terminal is also with the following functions:
Chemical synthesis request is received, the chemical synthesis request is used for the chemical synthesis route of request target compound;
The target compound is input to disaggregated model, obtains response rule corresponding with the target compound;Institute Stating disaggregated model is with compound for input, and take response rule as the machine learning model of output;
Decomposition retrosynthetic analysis is carried out to the target compound according to the response rule to obtain synthesizing the targeted Raw material needed for closing object;
Chemical synthesis route corresponding with the target compound is determined according to the raw material.
Optionally, the processor 1180 is also used to execute the training method of disaggregated model provided by the embodiments of the present application The step of any one implementation.
The embodiment of the present application also provides a kind of for the abnormal equipment for commenting on detection, the equipment application in service side, With structure identical with equipment shown in Figure 11, wherein processor 1180 is for executing following steps:
Training sample is obtained, the training sample includes compound and reaction information relevant to the compound, institute Reaction information is stated including at least response rule;
According to the training sample, using machine learning algorithm train classification models, until meeting training termination condition;
Wherein, the disaggregated model is input with compound, is output with response rule.
Optionally, processor 1180 can be also used for execute the embodiment of the present application in disaggregated model training method it is any A kind of the step of implementation.
The embodiment of the present application also provides a kind of computer readable storage medium, for storing program code, the program code For executing in a kind of determination method of chemical synthesis route described in foregoing individual embodiments or the training method of disaggregated model Any one embodiment.
The embodiment of the present application also provides a kind of computer program product including instruction, when run on a computer, So that computer executes a kind of determination method of chemical synthesis route described in foregoing individual embodiments or the training of disaggregated model Any one embodiment in method.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed device and method can pass through it Its mode is realized.For example, the apparatus embodiments described above are merely exemplary, for example, the division of the unit, only Only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components can be tied Another system is closed or is desirably integrated into, or some features can be ignored or not executed.Another point, it is shown or discussed Mutual coupling, direct-coupling or communication connection can be through some interfaces, the INDIRECT COUPLING or logical of device or unit Letter connection can be electrical property, mechanical or other forms.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme 's.
It should be appreciated that in this application, " at least one (item) " refers to one or more, and " multiple " refer to two or two More than a."and/or" indicates may exist three kinds of relationships, for example, " A and/or B " for describing the incidence relation of affiliated partner It can indicate: only exist A, only exist B and exist simultaneously tri- kinds of situations of A and B, wherein A, B can be odd number or plural number.Word Symbol "/" typicallys represent the relationship that forward-backward correlation object is a kind of "or"." at least one of following (a) " or its similar expression, refers to Any combination in these, any combination including individual event (a) or complex item (a).At least one of for example, in a, b or c (a) can indicate: a, b, c, " a and b ", " a and c ", " b and c ", or " a and b and c ", and wherein a, b, c can be individually, can also To be multiple.
The above, above embodiments are only to illustrate the technical solution of the application, rather than its limitations;Although referring to before Embodiment is stated the application is described in detail, those skilled in the art should understand that: it still can be to preceding Technical solution documented by each embodiment is stated to modify or equivalent replacement of some of the technical features;And these It modifies or replaces, the spirit and scope of each embodiment technical solution of the application that it does not separate the essence of the corresponding technical solution.

Claims (13)

1. a kind of determination method of chemical synthesis route, which is characterized in that the described method includes:
Chemical synthesis request is received, the chemical synthesis request is used for the chemical synthesis route of request target compound;
The target compound is input to disaggregated model, obtains response rule corresponding with the target compound;Described point Class model is with compound for input, and take response rule as the machine learning model of output;
Decomposition retrosynthetic analysis is carried out to the target compound according to the response rule to obtain synthesizing the target compound Required raw material;
Chemical synthesis route corresponding with the target compound is determined according to the raw material.
2. the method according to claim 1, wherein it is described according to the response rule to the target compound It carries out decomposing retrosynthetic analysis and obtains raw material needed for synthesizing the target compound and include:
Decomposition retrosynthetic analysis is carried out to the target compound according to the response rule and obtains decomposition product;
Whether search the decomposition product from raw database with the determination decomposition product is raw material;
Decomposition product is not raw material if it exists, then using the decomposition product as target compound, execution inputs the target compound To disaggregated model, response rule corresponding with the target compound is obtained, and according to the response rule to the targeted The step of object carries out decomposition retrosynthetic analysis is closed, until the decomposition product is raw material.
3. the method according to claim 1, wherein the method also includes:
It receives and the amendment that the chemical synthesis route is modified is requested;
It is requested in response to the amendment, corrects the chemical synthesis route.
4. according to claim 1 to method described in 3 any one, which is characterized in that the method also includes:
The target compound is converted into corresponding vector using term vector model;
It is then described the target compound is input to disaggregated model to include:
The corresponding vector of the target compound is input to disaggregated model.
5. according to claim 1 to method described in 3 any one, which is characterized in that the machine learning model includes gradient Decline any one or more in decision-tree model, Random Forest model, supporting vector machine model and neural network model.
6. a kind of training method of disaggregated model, which is characterized in that the described method includes:
Training sample is obtained, the training sample includes compound and reaction information relevant to the compound, described anti- Information is answered to include at least response rule;
According to the training sample, using machine learning algorithm train classification models, until meeting training termination condition;
Wherein, the disaggregated model is input with compound, is output with response rule.
7. according to the method described in claim 6, it is characterized in that, response rule obtains in the following way in the training sample It arrives:
Acquire chemical equation;
Atomic series mark is carried out to the chemical equation, and based on the chemical equation abstraction reaction center after mark;
The reaction center is filtered and clustering processing, response rule is determined according to processing result.
8. according to the method described in claim 6, it is characterized in that, the compound in the training sample is deposited in the form of vectors The corresponding vector of the compound obtains in the following way:
The chemical formula of compound each in the chemical equation of acquisition is converted into the minor structure sequence that radius is 1 using root algorithm is rubbed Column;
All minor structure sequences are converted into the corresponding vector of the minor structure sequence using term vector model;
For each compound, the corresponding vector of minor structure sequence of the compound is summed up to obtain the compound pair The vector answered.
9. according to method described in claim 6 to 8 any one, which is characterized in that the machine learning algorithm includes gradient Decline any one or more in decision Tree algorithms, random forests algorithm, algorithm of support vector machine and neural network algorithm.
10. a kind of determining device of chemical synthesis route, which is characterized in that described device includes:
Receiving module, for receiving chemical synthesis request, chemistry of the chemical synthesis request for request target compound is closed At route;
Module is obtained, for the target compound to be input to disaggregated model, is obtained corresponding with the target compound anti- Answer rule;Institute's disaggregated model is with compound for input, and take response rule as the machine learning model of output;
Decomposing module obtains synthesis institute for carrying out decomposition retrosynthetic analysis to the target compound according to the response rule State raw material needed for target compound;
Determining module, for determining chemical synthesis route corresponding with the target compound according to the raw material.
11. a kind of training device of disaggregated model, which is characterized in that described device includes:
Module is obtained, for obtaining training sample, the training sample includes compound and relevant to the compound anti- Information is answered, the reaction information includes at least response rule;
Training module, for using machine learning algorithm train classification models, being tied until meeting training according to the training sample Beam condition;
Wherein, the disaggregated model is input with compound, is output with response rule.
12. a kind of equipment, which is characterized in that the equipment includes processor and memory:
Said program code is transferred to the processor for storing program code by the memory;
The processor is used to be closed according to the chemistry described in any one of claim 1 to 5 of the instruction execution in said program code At the training method of the described in any item disaggregated models of determination method or claim 6 to 9 of route.
13. a kind of computer readable storage medium, which is characterized in that the computer readable storage medium is for storing program generation Code, said program code require the determination method or right of 1 to 5 described in any item chemical synthesis routes for perform claim It is required that the training method of 6 to 9 described in any item disaggregated models.
CN201910194016.3A 2019-03-14 2019-03-14 A kind of determination method and device of chemical synthesis route Pending CN109872780A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910194016.3A CN109872780A (en) 2019-03-14 2019-03-14 A kind of determination method and device of chemical synthesis route

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910194016.3A CN109872780A (en) 2019-03-14 2019-03-14 A kind of determination method and device of chemical synthesis route

Publications (1)

Publication Number Publication Date
CN109872780A true CN109872780A (en) 2019-06-11

Family

ID=66920313

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910194016.3A Pending CN109872780A (en) 2019-03-14 2019-03-14 A kind of determination method and device of chemical synthesis route

Country Status (1)

Country Link
CN (1) CN109872780A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111524557A (en) * 2020-04-24 2020-08-11 腾讯科技(深圳)有限公司 Inverse synthesis prediction method, device, equipment and storage medium based on artificial intelligence
CN112037868A (en) * 2020-11-04 2020-12-04 腾讯科技(深圳)有限公司 Training method and device for neural network for determining molecular reverse synthetic route
WO2020254884A1 (en) * 2019-06-18 2020-12-24 International Business Machines Corporation Generating control settings for a chemical reactor
CN112652365A (en) * 2020-03-05 2021-04-13 腾讯科技(深圳)有限公司 Inverse synthesis processing method, device, electronic equipment and computer readable storage medium
CN113053474A (en) * 2021-04-13 2021-06-29 南京信息工程大学 Light chargeable material screening system and operation method thereof
CN113140260A (en) * 2020-01-20 2021-07-20 腾讯科技(深圳)有限公司 Method and device for predicting reactant molecular composition data of composition
EP3896699A1 (en) * 2020-03-06 2021-10-20 Accenture Global Solutions Limited Using machine learning for generating chemical product formulations
CN113782109A (en) * 2021-09-13 2021-12-10 烟台国工智能科技有限公司 Reactant derivation method and reverse synthesis derivation method based on Monte Carlo tree
WO2022130648A1 (en) * 2020-12-18 2022-06-23 富士通株式会社 Information processing program, information processing method, and information processing device
CN115206450A (en) * 2022-09-15 2022-10-18 药融云数字科技(成都)有限公司 Synthetic route recommendation method and terminal
CN115312135A (en) * 2022-07-21 2022-11-08 苏州沃时数字科技有限公司 Method, system, device and storage medium for predicting chemical reaction conditions
US11675334B2 (en) 2019-06-18 2023-06-13 International Business Machines Corporation Controlling a chemical reactor for the production of polymer compounds

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008106080A2 (en) * 2007-02-27 2008-09-04 University Of Pittsburgh - Of The Commonwealth System Of Higher Education Synthesis of desacetoxytubulysin h and analogs thereof
CN101789047A (en) * 2010-02-05 2010-07-28 四川大学 Method for evaluating synthesization of organic small-molecule compounds based on reverse synthesis

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008106080A2 (en) * 2007-02-27 2008-09-04 University Of Pittsburgh - Of The Commonwealth System Of Higher Education Synthesis of desacetoxytubulysin h and analogs thereof
CN101789047A (en) * 2010-02-05 2010-07-28 四川大学 Method for evaluating synthesization of organic small-molecule compounds based on reverse synthesis

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
JAMES LAW ET.AL: "Route Designer: A Retrosynthetic Analysis Tool Utilizing Automated Retrosynthetic Rule Generation", 《J. CHEM. INF. MODEL》 *
JAVIER L. BAYLON ET.AL: "Enhancing Retrosynthetic Reaction Prediction with Deep Learning Using Multiscale Reaction Classification", 《JOURNAL OF CHEMICAL INFORMATION AND MODELING》 *
MARWIN H. S. SEGLER ET.AL: "Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction", 《CHEMISTRY》 *

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2599837A (en) * 2019-06-18 2022-04-13 Ibm Generating control settings for a chemical reactor
US11675334B2 (en) 2019-06-18 2023-06-13 International Business Machines Corporation Controlling a chemical reactor for the production of polymer compounds
WO2020254884A1 (en) * 2019-06-18 2020-12-24 International Business Machines Corporation Generating control settings for a chemical reactor
GB2599837B (en) * 2019-06-18 2023-05-03 Ibm Generating control settings for a chemical reactor
US11520310B2 (en) 2019-06-18 2022-12-06 International Business Machines Corporation Generating control settings for a chemical reactor
CN113140260A (en) * 2020-01-20 2021-07-20 腾讯科技(深圳)有限公司 Method and device for predicting reactant molecular composition data of composition
CN113140260B (en) * 2020-01-20 2023-09-08 腾讯科技(深圳)有限公司 Method and device for predicting reactant molecular composition data of composition
WO2021174895A1 (en) * 2020-03-05 2021-09-10 腾讯科技(深圳)有限公司 Retrosynthesis processing method and apparatus, and electronic device and computer-readable storage medium
CN112652365A (en) * 2020-03-05 2021-04-13 腾讯科技(深圳)有限公司 Inverse synthesis processing method, device, electronic equipment and computer readable storage medium
EP4092682A4 (en) * 2020-03-05 2023-08-23 Tencent Technology (Shenzhen) Company Limited Retrosynthesis processing method and apparatus, and electronic device and computer-readable storage medium
EP3896699A1 (en) * 2020-03-06 2021-10-20 Accenture Global Solutions Limited Using machine learning for generating chemical product formulations
CN111524557B (en) * 2020-04-24 2024-04-05 腾讯科技(深圳)有限公司 Inverse synthesis prediction method, device, equipment and storage medium based on artificial intelligence
CN111524557A (en) * 2020-04-24 2020-08-11 腾讯科技(深圳)有限公司 Inverse synthesis prediction method, device, equipment and storage medium based on artificial intelligence
CN112037868A (en) * 2020-11-04 2020-12-04 腾讯科技(深圳)有限公司 Training method and device for neural network for determining molecular reverse synthetic route
WO2022095659A1 (en) * 2020-11-04 2022-05-12 腾讯科技(深圳)有限公司 Method and apparatus for training neural network for determining molecule retrosynthesis route
EP4213153A4 (en) * 2020-11-04 2024-03-20 Tencent Tech Shenzhen Co Ltd Method and apparatus for training neural network for determining molecule retrosynthesis route
CN112037868B (en) * 2020-11-04 2021-02-12 腾讯科技(深圳)有限公司 Training method and device for neural network for determining molecular reverse synthetic route
WO2022130648A1 (en) * 2020-12-18 2022-06-23 富士通株式会社 Information processing program, information processing method, and information processing device
CN113053474A (en) * 2021-04-13 2021-06-29 南京信息工程大学 Light chargeable material screening system and operation method thereof
CN113782109A (en) * 2021-09-13 2021-12-10 烟台国工智能科技有限公司 Reactant derivation method and reverse synthesis derivation method based on Monte Carlo tree
CN115312135A (en) * 2022-07-21 2022-11-08 苏州沃时数字科技有限公司 Method, system, device and storage medium for predicting chemical reaction conditions
CN115312135B (en) * 2022-07-21 2023-10-20 苏州沃时数字科技有限公司 Chemical reaction condition prediction method, system, device and storage medium
CN115206450B (en) * 2022-09-15 2022-12-06 药融云数字科技(成都)有限公司 Synthetic route recommendation method and terminal
CN115206450A (en) * 2022-09-15 2022-10-18 药融云数字科技(成都)有限公司 Synthetic route recommendation method and terminal

Similar Documents

Publication Publication Date Title
CN109872780A (en) A kind of determination method and device of chemical synthesis route
US10622098B2 (en) Systems and methods for predicting chemical reactions
Aziz et al. Dimension reduction methods for microarray data: a review
CN107862173B (en) Virtual screening method and device for lead compound
Nyathi et al. Comparison of a genetic algorithm to grammatical evolution for automated design of genetic programming classification algorithms
Nemati et al. A novel ACO–GA hybrid algorithm for feature selection in protein function prediction
Bebek et al. PathFinder: mining signal transduction pathway segments from protein-protein interaction networks
Liu et al. Evolving feature selection
He et al. A generalization of vit/mlp-mixer to graphs
CN108108743A (en) Abnormal user recognition methods and the device for identifying abnormal user
CN109727637B (en) Method for identifying key proteins based on mixed frog-leaping algorithm
JP2022505540A (en) Systems and methods for active transfer learning using deep feature extraction
CA2942106A1 (en) Aligning and clustering sequence patterns to reveal classificatory functionality of sequences
CN111724866A (en) Automated prediction of biological response of compounds based on chemical information
Al-Maqaleh et al. A genetic algorithm for discovering classification rules in data mining
Wang et al. Molecular property prediction based on a multichannel substructure graph
Ye et al. Adaptive unsupervised feature learning for gene signature identification in non-small-cell lung cancer
Flores-Garrido et al. Mining maximal frequent patterns in a single graph using inexact matching
Azevedo et al. Hybrid approaches to optimization and machine learning methods: a systematic literature review
Garimella et al. Churn prediction using optimized deep learning classifier on huge telecom data
CN112115996B (en) Image data processing method, device, equipment and storage medium
Hansen et al. A VNS heuristic for escaping local extrema entrapment in normalized cut clustering
Nelson et al. Higher order interactions: detection of epistasis using machine learning and evolutionary computation
Ye et al. High‐Dimensional Feature Selection Based on Improved Binary Ant Colony Optimization Combined with Hybrid Rice Optimization Algorithm
Choong et al. Jack and Masters of all Trades: One-Pass Learning Sets of Model Sets From Large Pre-Trained Models

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190611

RJ01 Rejection of invention patent application after publication