WO2022042506A1 - Convolutional neural network-based cell screening method and device - Google Patents

Convolutional neural network-based cell screening method and device Download PDF

Info

Publication number
WO2022042506A1
WO2022042506A1 PCT/CN2021/114165 CN2021114165W WO2022042506A1 WO 2022042506 A1 WO2022042506 A1 WO 2022042506A1 CN 2021114165 W CN2021114165 W CN 2021114165W WO 2022042506 A1 WO2022042506 A1 WO 2022042506A1
Authority
WO
WIPO (PCT)
Prior art keywords
cells
cell
training
convolutional neural
neural network
Prior art date
Application number
PCT/CN2021/114165
Other languages
French (fr)
Chinese (zh)
Inventor
陈亮
韩晓健
侯媛媛
哈斯木买买提依明
梁国龙
Original Assignee
深圳太力生物技术有限责任公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳太力生物技术有限责任公司 filed Critical 深圳太力生物技术有限责任公司
Publication of WO2022042506A1 publication Critical patent/WO2022042506A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B35/00ICT specially adapted for in silico combinatorial libraries of nucleic acids, proteins or peptides
    • G16B35/10Design of libraries
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding

Definitions

  • the present application relates to the field of biotechnology, in particular to a cell screening method, device, computer equipment and storage medium based on convolutional neural network.
  • the cells in the cell pool can be transfected first, and the cell pool can be processed by a limiting dilution method to obtain a single cell, and then a single cell can be obtained.
  • Cells are cultured with homogeneous cell populations, namely cell lines, and the cell lines with high target protein expression are screened.
  • a cell screening method based on a convolutional neural network comprising:
  • a plurality of grayscale images of cells to be tested corresponding to a plurality of cells to be tested are input into the target convolutional neural network model;
  • the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels, and the said target convolutional neural network model is obtained by training
  • the expression label is used to characterize the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the protein expression of the cells in the grayscale image of the cells to be tested of the input model;
  • target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
  • the actual protein expression level of the cells in the corresponding training cell grayscale map is determined, and the expression level label corresponding to the training cell grayscale map is obtained based on the actual protein expression level;
  • the convolutional neural network model is trained by using the training cell grayscale image and the expression label to generate a target convolutional neural network model.
  • the training cell fluorescence map determine the actual protein expression level of the cells in the corresponding training cell grayscale map, and obtain the expression level label corresponding to the training cell grayscale map based on the actual protein expression level.
  • the real protein expression level is determined as the expression level label corresponding to the grayscale image of the training cells.
  • the training of a convolutional neural network model using the training cell grayscale image and the expression label to generate a target convolutional neural network model including:
  • the network parameters of the convolutional neural network model are adjusted by reducing the error to obtain optimal network parameters, and the target convolutional neural network model is generated by using the optimal network parameters.
  • the convolutional neural network model includes a multi-layer structure, and according to the training error, the network parameters of the convolutional neural network model are adjusted by reducing the error to obtain optimal network parameters, including: :
  • the convolutional neural network model includes a first network structure, a second network structure, a third network structure, a fourth network structure and a fully connected layer;
  • the training cell grayscale map is a plurality of training cell grayscales. picture;
  • For each training cell grayscale image obtain the corresponding training cell feature through the first network structure; input the training cell feature into the second network structure, the third network structure and the fourth network structure to obtain the corresponding a first cell feature, a second cell feature, and a third cell feature;
  • the first cell feature, the second cell feature and the third cell feature are connected in parallel to obtain the feature fusion result corresponding to each training cell grayscale image; wherein the first cell feature, the second cell feature and the third cell feature are Three-cell features have different levels of abstract expression;
  • the feature fusion results corresponding to the grayscale images of the training cells are input to the fully connected layer, and the protein expression levels corresponding to the grayscale images of the training cells are determined according to the output results of the fully connected layer.
  • the target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested, including:
  • the grayscale image of the cell to be tested corresponding to the target expression level is determined, and the cell to be tested corresponding to the grayscale image of the cell to be tested is determined as the target cell.
  • a cell screening device based on a convolutional neural network comprising:
  • the grayscale image acquisition module of the cells to be tested is used to acquire the grayscale images of the cells to be tested corresponding to a plurality of cells to be tested in the cell culture tank;
  • the first input module is used to input multiple grayscale images of cells to be tested corresponding to multiple cells to be tested into the target convolutional neural network model; the target convolutional neural network model The degree map training is obtained, the expression label is used to represent the real protein expression of the cells in the grayscale map of each training cell, and the target convolutional neural network model is used to detect the input model. protein expression;
  • a protein expression level prediction module configured to obtain the respective protein expression levels corresponding to the plurality of cells to be tested according to the output of the target convolutional neural network model
  • the cell screening module is used to determine, from the plurality of cells to be tested, the target cells whose protein expression meets the set condition according to the protein expression levels corresponding to the plurality of cells to be tested.
  • a computer device comprising a memory and a processor, wherein the memory stores a computer program, and when the processor executes the computer program, the steps of the above-mentioned cell screening method based on a convolutional neural network are implemented:
  • the above-mentioned cell screening method, device, computer equipment and storage medium based on convolutional neural network by acquiring the grayscale images of the cells to be tested corresponding to the plurality of cells to be tested in the cell culture tank, and the corresponding cells of the plurality of cells to be tested.
  • Multiple grayscale images of the cells to be tested are input into the target convolutional neural network model, and according to the output of the target convolutional neural network model, the protein expression levels corresponding to the plurality of cells to be tested are obtained, and then the protein expression levels corresponding to the plurality of cells to be tested are obtained.
  • the target cells whose protein expression levels meet the set conditions are determined from multiple cells to be tested, which realizes the rapid determination of cells with high protein expression levels, and avoids the need to undergo repeated culture and screening before cell screening can be performed.
  • the application can quickly process millions of single cells, while increasing the range of cell screening, reducing the workload of staff and effectively improving the efficiency of cell screening.
  • FIG. 1 is a schematic flowchart of a cell screening method based on a convolutional neural network in one embodiment
  • Fig. 2 is a schematic flowchart of steps of model training in one embodiment
  • Figure 3a is a grayscale image of a training cell in one embodiment
  • Figure 3b is a fluorescence image of a training cell in one embodiment
  • FIG. 5 is a schematic flowchart of a step of model parameter adjustment in one embodiment
  • FIG. 6 is a schematic flowchart of a step of predicting protein expression in one embodiment
  • FIG. 7 is a structural block diagram of a cell screening device based on a convolutional neural network in one embodiment
  • FIG. 8 is a diagram of the internal structure of a computer device in one embodiment.
  • the cells in the cell pool can be transfected first, and the cell pool can be processed by a limiting dilution method to obtain a single cell, and then a single cell can be obtained.
  • Cells are cultured with homogeneous cell populations, namely cell lines, and the cell lines with high target protein expression are screened.
  • a cell screening method based on a convolutional neural network is provided.
  • the method is applied to a terminal for illustration. It can be understood that the method can also be applied to
  • the server can also be applied to a system including a terminal and a server, and the method is implemented through the interaction between the terminal and the server.
  • the method includes the following steps:
  • Step 101 Obtain grayscale images of cells to be tested corresponding to a plurality of cells to be tested in the cell culture tank.
  • the cells to be tested may be cells treated with transfection technology
  • the cells to be tested may be cells that fail to obtain exogenous DNA fragments after treatment with transfection technology, or cells that have obtained exogenous DNA fragments but not A cell that has been integrated into a chromosome, or a cell in which a foreign DNA segment has been integrated into a chromosome.
  • the grayscale image of the cells to be tested is a grayscale image of the cells to be tested.
  • multiple cells in the cell culture pool can be transfected, so that some or all of the cells in the cell culture pool can obtain exogenous DNA fragments.
  • grayscale images of the cells to be tested corresponding to the plurality of cells to be tested in the cell culture tank can be obtained.
  • Step 102 Inputting multiple grayscale images of cells to be tested corresponding to multiple cells to be tested into a target convolutional neural network model; the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels. , the expression label is used to represent the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the cells in the grayscale image of the cells to be tested that are input to the target convolutional neural network model. protein expression.
  • the grayscale image of the training cells is a picture used as a training sample, which is used to train a convolutional neural network model, and the cells in the grayscale image of the training cells are cells that have been transfected.
  • the convolutional neural network model can be trained by using multiple grayscale images of training cells with expression labels to obtain the target convolutional neural network model.
  • the expression label corresponding to the grayscale image of the training cell can represent the real protein expression of the cell in the grayscale image of the training cell. Since the main expression product of the gene is protein, the grayscale image of the training cell can be determined through the actual protein expression. expression of target genes in cells.
  • the grayscale images of the cells to be tested can be input into the trained target convolutional neural network model, and the target convolutional neural network model can be used to detect The protein expression of the cells in the grayscale image of the cells to be tested.
  • Step 103 according to the output of the target convolutional neural network model, obtain the respective protein expression levels corresponding to the plurality of cells to be tested.
  • the protein expression level corresponding to the cells to be tested is the protein expression level of the cells in the grayscale image of the cells to be tested, and the protein expression level is predicted by the target convolutional neural network model.
  • the protein expression levels corresponding to the cells to be tested can be obtained according to the output of the target convolutional neural network model.
  • Step 104 according to the protein expression levels corresponding to the plurality of cells to be tested, determine, from the plurality of cells to be tested, target cells whose protein expression meets the set condition.
  • the plurality of cells to be tested can be screened, and the target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
  • the grayscale images of the cells to be tested corresponding to the cells to be tested are input into the target convolutional neural network model , and according to the output of the target convolutional neural network model, the protein expression levels corresponding to the multiple cells to be tested are obtained, and then the protein expression levels are determined from the multiple cells to be tested according to the protein expression levels corresponding to the multiple cells to be tested.
  • Target cells that meet the set conditions can quickly determine cells with high protein expression, avoid the need for repeated culture and screening before cell screening, greatly shorten the screening cycle, and quickly process millions of cells through this application
  • the single cell can increase the range of cell screening, reduce the workload of the staff, and effectively improve the efficiency of cell screening.
  • the cell screening method based on convolutional neural network may further include the following steps:
  • Step 201 acquiring a training cell grayscale image and its corresponding training cell fluorescence image.
  • a cell as a training set can be set, and the cell can be photographed to obtain a grayscale image of the training cell and a corresponding fluorescence image of the training cell, respectively.
  • the cells in the training set can be used to train the convolutional neural network model, and the grayscale image of the training cells and the corresponding fluorescence image of the training cells can be the grayscale images obtained by shooting the same cells under the same shooting conditions and Fluorescence map.
  • the cells in the training set can be cells in the cell culture pool that have been processed by transfection technology.
  • the cells can be cells that have not been able to obtain exogenous DNA fragments after the transfection technology treatment, or cells that have obtained exogenous DNA fragments. But cells that are not integrated into chromosomes, or cells that have foreign DNA fragments integrated into chromosomes.
  • the grayscale image and the fluorescence image can be captured simultaneously with a microscope under the same shooting conditions, and the obtained grayscale image and fluorescence image can include one or more cells , the coordinates of each cell in the grayscale image correspond to the coordinates of that cell in the fluorescence image. Since the same grayscale image and fluorescence image can contain multiple cells at the same time, after obtaining the grayscale image and the fluorescence image, the grayscale image and the fluorescence image can be segmented to obtain the training cell grayscale image and training cell corresponding to a single cell.
  • the fluorescence images are shown in Fig. 3a and Fig. 3b.
  • Step 202 according to the training cell fluorescence map, determine the actual protein expression level of the cells in the corresponding training cell grayscale map, and obtain an expression level label corresponding to the training cell grayscale map based on the actual protein expression level.
  • the actual protein expression level of the cells in the corresponding training cell grayscale map can be determined according to the training cell fluorescence map, and the expression level label corresponding to the training cell grayscale map can be obtained based on the real protein expression level.
  • Step 203 using the training cell grayscale image and the expression label to train a convolutional neural network model to generate a target convolutional neural network model.
  • the training cell grayscale image and the corresponding expression label can be used to train the convolutional neural network model to generate the target convolutional neural network model.
  • the cell grayscale map and the cell protein expression level in the cell grayscale map can be established.
  • the relationship between them provides model support for the rapid screening of cells with high protein expression.
  • the actual protein expression of cells in the corresponding training cell grayscale image is determined according to the training cell fluorescence image, and the expression corresponding to the training cell grayscale image is obtained based on the actual protein expression.
  • quantity label which can include the following steps:
  • proteins produced by genes of interest can fluoresce at specific wavelengths.
  • the value corresponding to the green channel in the training cell fluorescence map (also called the fluorescence value) can be determined, and the real protein expression of the cells in the corresponding training cell grayscale image can be determined according to the value, and then The real protein expression can be determined as the expression label corresponding to the grayscale image of the training cells.
  • the relationship between the fluorescence value and the protein expression amount can be a positive correlation, and by obtaining the quantitative mapping relationship between the fluorescence value and the protein expression amount, the real protein expression amount can be determined by the green brightness value.
  • the actual protein expression level of the cells in the corresponding training cell grayscale image is determined, and the actual protein expression level is determined as the corresponding training cell grayscale image.
  • the expression label can use the value corresponding to the green channel of the training cell fluorescence image as an intermediate variable to quantify the protein expression of the cells in the grayscale image of the training cell, and obtain the expression label corresponding to the grayscale image of the training cell, providing accurate real protein Expression data.
  • the training of a convolutional neural network model by using the training cell grayscale image and the expression label to generate a target convolutional neural network model may include the following steps:
  • Step 401 Input the grayscale image of the training cells into the convolutional neural network model, and determine the protein expression level corresponding to the grayscale image of the training cells.
  • the training cell grayscale image can be input into the convolutional neural network model, and the output result of the convolutional neural network model can be used to determine the corresponding grayscale image of the training cell.
  • the protein expression level where the convolutional neural network model is used to predict the protein expression level of the cells in the grayscale image of the training cells.
  • the convolutional neural network model takes the grayscale image of the training cells as input. Predict the corresponding protein expression level.
  • Step 402 Determine the training error according to the protein expression level corresponding to the grayscale image of the training cells and the expression level label.
  • the predicted protein expression level is the same as the protein expression level predicted by the convolutional neural network model. There is a gap between the real protein expression levels. Based on this, the training error between the two can be determined by training the protein expression level and expression level label corresponding to the grayscale image of the training cells. In practical applications, the cost function can be used to calculate the training error between the protein expression level corresponding to the grayscale image of the training cell and the expression level label.
  • Step 403 according to the training error, adjust the network parameters of the convolutional neural network model by reducing the error to obtain optimal network parameters, and use the optimal network parameters to generate a target convolutional neural network model.
  • the network parameters of the convolutional neural network model can be adjusted according to the training error and the adjustment purpose of reducing the error until the optimal network parameters are obtained.
  • the target convolutional neural network model can be generated based on the optimal network parameters.
  • the training error is determined by the protein expression level and the expression level label corresponding to the grayscale image of the training cells, and the network parameters of the convolutional neural network model are adjusted according to the training error to obtain the optimal network parameters, and the optimal network parameters are adopted.
  • the optimal network parameters generate the target convolutional neural network model, which can continuously train and optimize the convolutional neural network model through the gap between the predicted protein expression and the actual protein expression.
  • the convolutional neural network model includes a multi-layer structure, and according to the training error, the network parameters of the convolutional neural network model are adjusted by reducing the error to obtain optimal network parameters , which can include the following steps:
  • the last layer of backpropagation adjusts the network parameters of each layer of the convolutional neural network model by reducing the error, and returns to input the grayscale image of the training cells into the convolutional neural network model to determine the training Steps of protein expression levels corresponding to cell grayscale images.
  • convolutional neural network models can include multi-layer structures, such as setting max pooling layers, average pooling layers, and multi-layer convolutional layers.
  • it is determined whether the training error converges and is smaller than the preset error threshold.
  • the network parameters of the current convolutional neural network model can be determined as the optimal network parameters;
  • the last layer of the network model is back-propagated, and the network parameters of each layer in the convolutional neural network model are adjusted according to the adjustment direction of reducing the error.
  • the grayscale image of the training cells can be returned to the convolutional neural network model. , the steps of determining the protein expression level corresponding to the grayscale image of the training cells.
  • the training cell grayscale image and the training fluorescence image can be obtained.
  • the training cell grayscale image can be input into the convolutional neural network model.
  • the output value (that is, the protein expression corresponding to the grayscale image of the training cells in this application) is obtained through propagation, and the training error between the output value and the real value (that is, the expression label in this application) is calculated.
  • the training error After the training error is obtained, it can be judged whether the training error is converged and small enough. If so, the training error can be back-propagated, and the SGD (Stochastic Gradient Descent) algorithm or other optimization algorithm can be used to update the connection weights and biases of each layer (ie network parameters in this application), and pass forward propagation again to obtain the output value; if not, the current network parameters can be determined as the optimal network parameters, and the target convolutional neural network model is generated based on the optimal network parameters.
  • SGD Spochastic Gradient Descent
  • the transfected cells in the cell culture pool can be divided into a training set, a validation set and a test set, in which the grayscale images of the cells in the training set corresponding to the training cells can be used to train the convolutional neural network model; the validation set The grayscale image of the cells corresponding to the cells can be used to verify the trained target convolutional neural network model to prevent the model from overfitting on the training set, and the accuracy of the model during the training process can be determined by the validation set; , which can be the cells to be tested in the present application.
  • the target convolutional neural network model the target cells whose protein expression meets the set conditions can be determined from a plurality of cells to be tested.
  • the training error can be back-propagated from the last layer of the convolutional neural network model, and the network parameters of each layer of the convolutional neural network model can be adjusted by reducing the error.
  • the network parameters can be continuously optimized through iterative calculation until the predicted protein expression is close to the real protein expression, which improves the prediction accuracy of the target convolutional neural network model for the protein expression.
  • the convolutional neural network model includes a first network structure, a second network structure, a third network structure, a fourth network structure and a fully connected layer
  • the training cell grayscale image may be a plurality of training cell grayscale images .
  • inputting the training cell grayscale image into the convolutional neural network model, and determining the protein expression level corresponding to the training cell grayscale image may include the following steps:
  • Step 601 Inputting multiple grayscale images of training cells into the convolutional neural network model.
  • multiple grayscale images of training cells can be input into the convolutional neural network model.
  • Step 602 for each training cell grayscale image, obtain the corresponding training cell feature through the first network structure; input the training cell feature into the second network structure, the third network structure and the fourth network structure, The corresponding first cell feature, second cell feature and third cell feature are obtained.
  • the training cell features corresponding to the training cell grayscale map can be obtained through the first network structure, and the training cell features can be input into the second network structure, the third network structure, and the fourth network structure to obtain Corresponding first cell signature, second cell signature and third cell signature.
  • Step 603 Connect the first cell feature, the second cell feature and the third cell feature in parallel to obtain a feature fusion result corresponding to each training cell grayscale image; wherein, the first cell feature, the second cell feature Features and tertiary cell features have different levels of abstract expression.
  • the first cell feature, the second cell feature and the third cell feature can be connected in parallel to obtain the feature fusion result corresponding to each training cell grayscale image , wherein the first cell feature, the second cell feature, and the third cell feature can have different levels of abstract expression.
  • the first network structure may be a feature extraction network composed of 10 convolutional layers
  • the second network structure may be composed of 11 convolutional layers and an average pooling layer
  • the third network structure may be a 2-layer convolutional layer.
  • the accumulation layer and the maximum pooling layer are composed together
  • the fourth network structure can be a network composed of an average pooling layer added after the third network structure, that is, the fourth network structure can be composed of 2 layers of convolutional layers, maximum pooling layers and average pooling layers. Composition of pooling layers.
  • the shallow network can extract simple features in the grayscale image of the training cells, such as feature extraction for cell shape, color, texture and cell edge, which can reflect the specific features of a certain dimension of the cell , and the features extracted by the deep network can abstract the features extracted by the shallow network to obtain cell features that can reflect the overall cell. Based on this, after the specific training cell features are extracted from the first network structure, the training cell features can be further input into the second network structure, the third network structure and the fourth network structure, and through different levels of networks, different abstract expressions can be obtained. Hierarchical cellular characteristics.
  • the first cell feature, the second cell feature, and the third cell feature can be output in the form of matrices.
  • each matrix can be After multiplying by different weights, add and sum up, the result is the feature fusion result, in which the weight of the matrix is positively correlated with the proportion of cell features extracted by the network structure, that is, the larger the weight, the more cell features extracted by the network structure. higher proportion.
  • Step 604 Input the feature fusion results corresponding to the grayscale images of the training cells to the fully connected layer, and determine the protein expression levels corresponding to the grayscale images of the training cells according to the output results of the fully connected layer.
  • the feature fusion results corresponding to the grayscale images of the training cells can be input to the fully connected layer, and the protein expression corresponding to the grayscale images of the training cells can be determined according to the output results of the fully connected layer. quantity.
  • the input parameters can be defined as B*3*448*448, where B is the number of grayscale images of training cells input to the network each time the convolutional neural network model is trained, and 3 means the number of image channels is R , G, B three channels, 448 is the width and height of the picture.
  • each training cell feature can be input to the second network structure and the third network structure respectively.
  • the fourth network structure, the second network structure, the third network structure and the fourth network structure can respectively output a matrix of size B*100, that is, the first cell feature, the second cell feature and the third cell feature, where the matrix
  • the size can be adjusted during training, that is, the value of 100 can be adjusted according to actual needs.
  • each matrix can be multiplied by different weights and added and summed to obtain a matrix of B*100, that is, the result of feature fusion, where the weights can be 0
  • the number of changes in the interval to 1.
  • the result can be input to the fully connected layer.
  • the number of inputs of the fully connected layer corresponds to the size of the matrix, and the number of outputs is 1.
  • the result of feature fusion can pass through an input of 100 and output
  • the fully-connected layer with 1 obtains a vector whose output is in the form of B*1.
  • Each component in the vector corresponds to a grayscale image of the training cells, and the value of the component is the expression level of the second protein.
  • determining from the plurality of cells to be tested the target cells whose protein expression meets the set condition may include the following steps:
  • the grayscale map of the cells to be tested corresponding to the target expression level is determined, and the cells to be tested corresponding to the grayscale map of the cells to be tested are determined as target cells.
  • the protein expression levels corresponding to the plurality of cells to be tested can be sorted, and from the sorted protein expression levels, the most advanced prediction The amount of protein expression was determined as the target expression level.
  • the protein expression levels corresponding to the plurality of cells to be tested can be sorted in descending order, that is, sorted from large to small. After sorting, the protein expression levels corresponding to the top N names can be determined as the target expression levels. Of course, in practical applications, the protein expression level exceeding the preset expression level threshold can also be determined as the target expression level.
  • the grayscale image of the cell to be tested corresponding to the target expression level can be determined, and the cell to be tested corresponding to the grayscale image of the cell to be tested is determined as the target cell.
  • the target cells can be used to culture cell lines.
  • the protein expression levels corresponding to the plurality of cells to be tested are sorted, and according to the sorted protein expression levels corresponding to the plurality of cells to be tested, the protein expression levels of the plurality of cells to be tested are ranked first.
  • the preset number of cells are determined as target cells, which can quickly screen cells with high protein expression, which greatly reduces the screening workload.
  • the acquiring a grayscale image of the training cells may include the following steps:
  • the original cell grayscale image used for model training and perform normalization processing on the original cell grayscale image; perform data enhancement processing on the processed original cell grayscale image to obtain the training cell grayscale image; the Data enhancement processing includes any one or more of the following: rotation processing, inversion processing, contrast enhancement processing, and random cropping processing.
  • the original cell grayscale image used for model training can be obtained, and the original cell grayscale image can be normalized, wherein the original cell grayscale image can be photographed using a microscope on the cells used as the training set Grayscale image.
  • data enhancement processing can be performed on the processed raw grayscale image, such as rotating, flipping, randomly cropping the image, or enhancing the contrast of the image.
  • the training cell grayscale image is obtained, and the training cell grayscale image used for training the convolutional neural network model can be added.
  • the training samples are rapidly expanded to provide data support for the training of the convolutional neural network model.
  • FIGS. 1, 2, and 4-6 are displayed in sequence according to the arrows, these steps are not necessarily executed in the sequence indicated by the arrows.
  • the steps may be performed in other orders unless explicitly stated herein to indicate a strict order restriction on the performance of the steps.
  • at least a part of the steps in FIGS. 1, 2, and 4-6 may include multiple steps or multiple stages. These steps or stages are not necessarily executed at the same time, but may be executed at different times.
  • the order of execution of the stages is not necessarily sequential, but may be performed alternately or alternately with other steps or at least a portion of the steps or stages in the other steps.
  • a cell screening device based on a convolutional neural network which may include:
  • the grayscale image acquisition module 701 of the cells to be tested is used to obtain grayscale images of the cells to be tested corresponding to a plurality of cells to be tested in the cell culture tank;
  • the first input module 702 is used to input multiple grayscale images of the cells to be tested corresponding to the multiple cells to be tested into the target convolutional neural network model; the target convolutional neural network model is based on multiple training cells with expression labels.
  • the grayscale image training is obtained, the expression label is used to represent the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the cells to be tested that are input to the target convolutional neural network model.
  • the protein expression of cells in the grayscale image is used to input multiple grayscale images of the cells to be tested corresponding to the multiple cells to be tested into the target convolutional neural network model;
  • a protein expression level prediction module 703, configured to obtain the respective protein expression levels corresponding to the multiple cells to be tested according to the output of the target convolutional neural network model;
  • the cell screening module 704 is configured to determine, according to the protein expression levels corresponding to the plurality of cells to be tested, target cells whose protein expression meets the set condition from the plurality of cells to be tested.
  • the cell screening device based on convolutional neural network may further include:
  • the training cell grayscale image acquisition module is used to obtain the training cell grayscale image and its corresponding training cell fluorescence image
  • the expression label determination module is used to determine the real protein expression of the cells in the corresponding training cell grayscale image according to the training cell fluorescence map, and obtain the expression corresponding to the training cell grayscale image based on the real protein expression quantity label;
  • a training module is used to train a convolutional neural network model by using the training cell grayscale map and the expression label to generate a target convolutional neural network model.
  • the expression quantity label determination module includes:
  • the green brightness value determination submodule is used to determine the value of the green channel in the fluorescence image of the training cells
  • the real protein expression determination submodule is used to determine the real protein expression of the cells in the corresponding training cell grayscale image according to the value of the green channel in the training cell fluorescence image;
  • the expression level label generation sub-module is used for determining the real protein expression level as the expression level label corresponding to the grayscale image of the training cells.
  • the training module includes:
  • a protein expression level determination submodule configured to input the grayscale image of the training cells into the convolutional neural network model, and determine the protein expression level corresponding to the grayscale image of the training cells;
  • a training error determination submodule used for determining the training error according to the protein expression corresponding to the grayscale image of the training cells and the expression label;
  • a parameter adjustment sub-module configured to adjust the network parameters of the convolutional neural network model by reducing the error according to the training error to obtain optimal network parameters, and use the optimal network parameters to generate a target convolution Neural network model.
  • the convolutional neural network model includes a multi-layer structure
  • the parameter adjustment sub-module includes:
  • a judgment unit for judging whether the training error has converged and is less than a preset error threshold; if so, call the parameter determination unit; if not, call the backpropagation unit;
  • a parameter determination unit used for determining the network parameters of the current convolutional neural network model as the optimal network parameters
  • a back-propagation unit configured to use the training error to back-propagate from the last layer of the convolutional neural network model, adjust the network parameters of each layer of the convolutional neural network model by reducing the error, and return The step of inputting the training cell grayscale image into the convolutional neural network model, and determining the protein expression level corresponding to the training cell grayscale image.
  • the convolutional neural network model includes a first network structure, a second network structure, a third network structure, a fourth network structure and a fully connected layer;
  • the training cell grayscale image is a plurality of training cells grayscale image;
  • the protein expression level determination submodule includes:
  • the second input unit is used for inputting a plurality of training cell grayscale images into the convolutional neural network model
  • the training cell feature acquisition unit is used to obtain the corresponding training cell feature through the first network structure for each training cell grayscale image; input the training cell feature into the second network structure, the third network structure and The fourth network structure obtains the corresponding first cell feature, second cell feature and third cell feature;
  • a feature fusion result acquisition unit configured to connect the first cell feature, the second cell feature and the third cell feature in parallel to obtain a feature fusion result corresponding to each training cell grayscale image; wherein, the first cell feature The feature, the second cell feature, and the second cell feature have different levels of abstract expression;
  • the result output unit is used to input the feature fusion results corresponding to the grayscale images of the training cells to the fully connected layer, and determine the second corresponding grayscale images of the training cells according to the output results of the fully connected layer. protein expression.
  • the cell screening module 704 includes:
  • the sorting submodule is used to sort the protein expression levels corresponding to the multiple cells to be tested, and from the protein expression levels corresponding to the sorted multiple cells to be tested, determine the protein expression level of the first preset number as target expression level;
  • the target cell determination submodule is used to determine the grayscale image of the cell to be tested corresponding to the target expression level, and to determine the cell to be tested corresponding to the grayscale image of the cell to be tested as the target cell.
  • the training cell grayscale image acquisition module includes:
  • the original cell grayscale image acquisition sub-module is used to obtain the original cell grayscale image used for model training, and normalize the original cell grayscale image;
  • the data enhancement processing submodule is used to perform data enhancement processing on the processed original cell grayscale image to obtain the training cell grayscale image; the data enhancement processing includes any one or more of the following: rotation processing, flip processing, contrast Enhanced processing, random cropping processing.
  • Each module in the above-mentioned cell screening device based on convolutional neural network can be implemented in whole or in part by software, hardware and combinations thereof.
  • the above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
  • a computer device is provided, and the computer device may be a terminal, and its internal structure diagram may be as shown in FIG. 8 .
  • the computer equipment includes a processor, memory, a communication interface, a display screen, and an input device connected by a system bus.
  • the processor of the computer device is used to provide computing and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium, an internal memory.
  • the nonvolatile storage medium stores an operating system and a computer program.
  • the internal memory provides an environment for the execution of the operating system and computer programs in the non-volatile storage medium.
  • the communication interface of the computer device is used for wired or wireless communication with an external terminal, and the wireless communication can be realized by WIFI, operator network, NFC (Near Field Communication) or other technologies.
  • the computer program when executed by a processor, implements a convolutional neural network based cell screening method.
  • the display screen of the computer equipment may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment may be a touch layer covered on the display screen, or a button, a trackball or a touchpad set on the shell of the computer equipment , or an external keyboard, trackpad, or mouse.
  • FIG. 8 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
  • a computer device including a memory and a processor, a computer program is stored in the memory, and the processor implements the following steps when executing the computer program:
  • a plurality of grayscale images of cells to be tested corresponding to a plurality of cells to be tested are input into the target convolutional neural network model;
  • the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels, and the said target convolutional neural network model is obtained by training
  • the expression label is used to characterize the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the protein expression of the cells in the grayscale image of the cells to be tested of the input model;
  • target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
  • the processor when the processor executes the computer program, it also implements the steps in the other embodiments described above.
  • a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the following steps are implemented:
  • a plurality of grayscale images of cells to be tested corresponding to a plurality of cells to be tested are input into the target convolutional neural network model;
  • the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels, and the said target convolutional neural network model is obtained by training
  • the expression label is used to characterize the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the protein expression of the cells in the grayscale image of the cells to be tested of the input model;
  • target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
  • the computer program when executed by the processor, also implements the steps in the other embodiments described above.
  • Non-volatile memory may include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory, or optical memory, and the like.
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM may be in various forms, such as Static Random Access Memory (SRAM) or Dynamic Random Access Memory (DRAM), and the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Medical Informatics (AREA)
  • Molecular Biology (AREA)
  • Software Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Data Mining & Analysis (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computational Linguistics (AREA)
  • Genetics & Genomics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Library & Information Science (AREA)
  • Biochemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioethics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Databases & Information Systems (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Image Analysis (AREA)

Abstract

The present application relates to a convolutional neural network-based cell screening method and device, the method comprising: acquiring gray-scale images of cells to be tested which respectively correspond to multiple cells to be tested in a cell culture pool; inputting the multiple gray-scale images of cells to be tested which correspond to the multiple cells to be tested into a target convolutional neural network model; obtaining the protein expression levels respectively corresponding to the multiple cells to be tested according to the output of the convolutional neural network model; and determining, according to the protein expression levels corresponding to the multiple cells to be tested, from the multiple cells to be tested target cells having protein expression levels satisfying a preset condition. Cells having high protein expression levels can be quickly determined, cell screening can be performed without repeated culturing and screening, and the screening cycle can be significantly shortened.

Description

基于卷积神经网络的细胞筛选方法和装置Cell screening method and device based on convolutional neural network
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请要求于2020年08月26日递交至中国国家知识产权局、申请号为202010869638.4、发明名称为“基于卷积神经网络的细胞筛选方法和装置”的中国专利申请的优先权,其全部内容通过引用合并入本申请中。This application claims the priority of the Chinese patent application with the application number 202010869638.4 and the invention title "Cell Screening Method and Device Based on Convolutional Neural Networks", which was submitted to the State Intellectual Property Office of China on August 26, 2020, and its entire contents Incorporated into this application by reference.
技术领域technical field
本申请涉及生物技术领域,特别是涉及一种基于卷积神经网络的细胞筛选方法、装置、计算机设备和存储介质。The present application relates to the field of biotechnology, in particular to a cell screening method, device, computer equipment and storage medium based on convolutional neural network.
背景技术Background technique
随着基因工程技术的不断发展,从细胞池中分离出能够表达特定产物的单克隆细胞株已成为生物领域中的常见需求。With the continuous development of genetic engineering technology, the isolation of monoclonal cell lines capable of expressing specific products from cell pools has become a common requirement in the biological field.
在现有技术中,在获取用于培养单克隆细胞株的细胞时,可以先对细胞池中的细胞进行转染,并采用有限稀释法对细胞池进行处理,得到单个细胞,进而可以采用单个细胞培养具有同质性的细胞群体,即细胞株,并筛选其中目的蛋白表达量高的细胞株。In the prior art, when obtaining cells for culturing a monoclonal cell line, the cells in the cell pool can be transfected first, and the cell pool can be processed by a limiting dilution method to obtain a single cell, and then a single cell can be obtained. Cells are cultured with homogeneous cell populations, namely cell lines, and the cell lines with high target protein expression are screened.
然而,采用有限稀释法获取单细胞的过程较为繁琐,需要反复地培养和筛选,同时,由于细胞转染效率问题,目的蛋白表达水平高的细胞比例较低,导致筛选细胞筛选工作效率较低,筛选周期长,难以快速、准确地获取具有高目的蛋白表达量的细胞。However, the process of obtaining single cells by the limiting dilution method is cumbersome and requires repeated cultivation and screening. At the same time, due to the problem of cell transfection efficiency, the proportion of cells with high expression levels of the target protein is low, resulting in low efficiency of screening cells. The screening cycle is long, and it is difficult to obtain cells with high target protein expression quickly and accurately.
发明内容SUMMARY OF THE INVENTION
基于此,有必要针对上述技术问题,提供一种基于卷积神经网络的细胞筛选方法、装置、计算机设备和存储介质。Based on this, it is necessary to provide a cell screening method, device, computer equipment and storage medium based on convolutional neural network in view of the above technical problems.
一种基于卷积神经网络的细胞筛选方法,所述方法包括:A cell screening method based on a convolutional neural network, the method comprising:
获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图;Obtain the grayscale images of the cells to be tested corresponding to the plurality of cells to be tested in the cell culture pool;
将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型;所述目标卷积神经网络模型根据具有表达量标签的多张训练细胞灰度图训练得到,所述表达量标签用于表征各训练细胞灰度图中细胞的真实蛋白表达量,所述目标卷积神经网络模型用于检测输入模型的待测细胞灰度图中细胞的蛋白表达量;A plurality of grayscale images of cells to be tested corresponding to a plurality of cells to be tested are input into the target convolutional neural network model; the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels, and the said target convolutional neural network model is obtained by training The expression label is used to characterize the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the protein expression of the cells in the grayscale image of the cells to be tested of the input model;
根据所述目标卷积神经网络模型的输出,得到所述多个待测细胞分别对应的蛋白表达量;According to the output of the target convolutional neural network model, obtain the corresponding protein expression levels of the plurality of cells to be tested;
根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。According to the protein expression levels corresponding to the plurality of cells to be tested, target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
可选地,还包括:Optionally, also include:
获取训练细胞灰度图及其对应的训练细胞荧光图;Obtain the training cell grayscale image and its corresponding training cell fluorescence image;
根据所述训练细胞荧光图,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,基于所述真实蛋白表达量得到所述训练细胞灰度图对应的表达量标签;According to the training cell fluorescence map, the actual protein expression level of the cells in the corresponding training cell grayscale map is determined, and the expression level label corresponding to the training cell grayscale map is obtained based on the actual protein expression level;
采用所述训练细胞灰度图和所述表达量标签训练卷积神经网络模型,生成目标卷积神经网络模型。The convolutional neural network model is trained by using the training cell grayscale image and the expression label to generate a target convolutional neural network model.
可选地,所述根据所述训练细胞荧光图,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,基于所述真实蛋白表达量得到所述训练细胞灰度图对应的表达量标签,包括:Optionally, according to the training cell fluorescence map, determine the actual protein expression level of the cells in the corresponding training cell grayscale map, and obtain the expression level label corresponding to the training cell grayscale map based on the actual protein expression level. ,include:
确定所述训练细胞荧光图中绿色通道的数值;determining the value of the green channel in the training cell fluorescence image;
根据所述训练细胞荧光图中绿色通道的数值,确定对应的训练细胞灰度图中细胞的真实蛋白表达量;According to the value of the green channel in the fluorescence image of the training cells, determine the actual protein expression of the cells in the corresponding gray-scale image of the training cells;
将所述真实蛋白表达量确定为所述训练细胞灰度图对应的表达量标签。The real protein expression level is determined as the expression level label corresponding to the grayscale image of the training cells.
可选地,所述采用所述训练细胞灰度图和所述表达量标签训练卷积神经网络模型,生成目标卷积神经网络模型,包括:Optionally, the training of a convolutional neural network model using the training cell grayscale image and the expression label to generate a target convolutional neural network model, including:
将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量;Inputting the training cell grayscale image into the convolutional neural network model, and determining the protein expression corresponding to the training cell grayscale image;
根据所述训练细胞灰度图对应的蛋白表达量和所述表达量标签,确定训练误差;Determine the training error according to the protein expression corresponding to the grayscale image of the training cells and the expression label;
根据所述训练误差,通过减小误差对所述卷积神经网络模型的网络参数进行调整,以得到最优网络参数,并采用所述最优网络参数生成目标卷积神经网络模型。According to the training error, the network parameters of the convolutional neural network model are adjusted by reducing the error to obtain optimal network parameters, and the target convolutional neural network model is generated by using the optimal network parameters.
可选地,所述卷积神经网络模型包括多层结构,所述根据所述训练误差,通过减小误差对所述卷积神经网络模型的网络参数进行调整,以得到最优网络参数,包括:Optionally, the convolutional neural network model includes a multi-layer structure, and according to the training error, the network parameters of the convolutional neural network model are adjusted by reducing the error to obtain optimal network parameters, including: :
判断所述训练误差是否收敛且小于预设误差阈值;judging whether the training error is converged and less than a preset error threshold;
若是,确定当前的卷积神经网络模型的网络参数为最优网络参数;If so, determine the network parameters of the current convolutional neural network model as the optimal network parameters;
若否,采用所述训练误差从所述卷积神经网络模型的最后一层反向传播,通过减小误差对所述卷积神经网络模型各层的网络参数进行调整,并返回将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量的步骤。If not, use the training error to backpropagate from the last layer of the convolutional neural network model, adjust the network parameters of each layer of the convolutional neural network model by reducing the error, and return the training The cell grayscale image is input into the convolutional neural network model, and the step of determining the protein expression level corresponding to the training cell grayscale image.
可选地,所述卷积神经网络模型包括第一网络结构、第二网络结构、第三网络结构、第四网络结构和全连接层;所述训练细胞灰度图为多张训练细胞灰度图;Optionally, the convolutional neural network model includes a first network structure, a second network structure, a third network structure, a fourth network structure and a fully connected layer; the training cell grayscale map is a plurality of training cell grayscales. picture;
所述将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量,包括:Inputting the training cell grayscale image into the convolutional neural network model, and determining the protein expression corresponding to the training cell grayscale image, including:
将多张训练细胞灰度图输入所述卷积神经网络模型;inputting multiple grayscale images of training cells into the convolutional neural network model;
针对每张训练细胞灰度图,通过所述第一网络结构获取对应的训练细胞特征;将所述训练细胞特征输入所述第二网络结构、第三网络结构和第四网络结构,得到对应的第一细胞特征、第二细胞特征和第三细胞特征;For each training cell grayscale image, obtain the corresponding training cell feature through the first network structure; input the training cell feature into the second network structure, the third network structure and the fourth network structure to obtain the corresponding a first cell feature, a second cell feature, and a third cell feature;
对所述第一细胞特征、第二细胞特征和第三细胞特征进行并行连接,得到每张训练细胞灰度图对应的特征融合结果;其中,所述第一细胞特征、第二细胞特征和第三细胞特征具有不同的抽象表达层次;The first cell feature, the second cell feature and the third cell feature are connected in parallel to obtain the feature fusion result corresponding to each training cell grayscale image; wherein the first cell feature, the second cell feature and the third cell feature are Three-cell features have different levels of abstract expression;
将多张训练细胞灰度图分别对应的特征融合结果输入至所述全连接层,根据所述全连接 层的输出结果确定多张训练细胞灰度图分别对应的蛋白表达量。The feature fusion results corresponding to the grayscale images of the training cells are input to the fully connected layer, and the protein expression levels corresponding to the grayscale images of the training cells are determined according to the output results of the fully connected layer.
可选地,所述根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞,包括:Optionally, according to the protein expression levels corresponding to the plurality of cells to be tested, the target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested, including:
对多个待测细胞对应的蛋白表达量进行排序,并从排序后的多个蛋白表达量中,将排序最前的预设数量的蛋白表达量确定为目标表达量;Sorting the protein expression levels corresponding to the plurality of cells to be tested, and determining the pre-set number of protein expression levels at the top of the sorting as the target expression level from the multiple protein expression levels after sorting;
确定所述目标表达量对应的待测细胞灰度图,并将所述待测细胞灰度图对应的待测细胞确定为目标细胞。The grayscale image of the cell to be tested corresponding to the target expression level is determined, and the cell to be tested corresponding to the grayscale image of the cell to be tested is determined as the target cell.
一种基于卷积神经网络的细胞筛选装置,所述装置包括:A cell screening device based on a convolutional neural network, the device comprising:
待测细胞灰度图获取模块,用于获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图;The grayscale image acquisition module of the cells to be tested is used to acquire the grayscale images of the cells to be tested corresponding to a plurality of cells to be tested in the cell culture tank;
第一输入模块,用于将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型;所述目标卷积神经网络模型根据具有表达量标签的多张训练细胞灰度图训练得到,所述表达量标签用于表征各训练细胞灰度图中细胞的真实蛋白表达量,所述目标卷积神经网络模型用于检测输入模型的待测细胞灰度图中细胞的蛋白表达量;The first input module is used to input multiple grayscale images of cells to be tested corresponding to multiple cells to be tested into the target convolutional neural network model; the target convolutional neural network model The degree map training is obtained, the expression label is used to represent the real protein expression of the cells in the grayscale map of each training cell, and the target convolutional neural network model is used to detect the input model. protein expression;
蛋白表达量预测模块,用于根据所述目标卷积神经网络模型的输出,得到所述多个待测细胞分别对应的蛋白表达量;A protein expression level prediction module, configured to obtain the respective protein expression levels corresponding to the plurality of cells to be tested according to the output of the target convolutional neural network model;
细胞筛选模块,用于根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。The cell screening module is used to determine, from the plurality of cells to be tested, the target cells whose protein expression meets the set condition according to the protein expression levels corresponding to the plurality of cells to be tested.
一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现上述所述的基于卷积神经网络的细胞筛选方法的步骤:A computer device, comprising a memory and a processor, wherein the memory stores a computer program, and when the processor executes the computer program, the steps of the above-mentioned cell screening method based on a convolutional neural network are implemented:
一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现上述所述的基于卷积神经网络的细胞筛选方法的步骤:A computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the steps of the above-mentioned cell screening method based on a convolutional neural network are realized:
上述一种基于卷积神经网络的细胞筛选方法、装置、计算机设备和存储介质,通过获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图,将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型,并根据目标卷积神经网络模型的输出,得到多个待测细胞分别对应的蛋白表达量,进而根据多个待测细胞对应的蛋白表达量,从多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞,实现了快速确定具有高蛋白表达量的细胞,避免需要经过反复培养和筛选后才能进行细胞筛选,大大缩短的筛选周期,并且,通过本申请可以快速处理上百万的单细胞,在增加细胞筛选范围的同时,减少了工作人员的工作量,有效提升细胞筛选效率。The above-mentioned cell screening method, device, computer equipment and storage medium based on convolutional neural network, by acquiring the grayscale images of the cells to be tested corresponding to the plurality of cells to be tested in the cell culture tank, and the corresponding cells of the plurality of cells to be tested. Multiple grayscale images of the cells to be tested are input into the target convolutional neural network model, and according to the output of the target convolutional neural network model, the protein expression levels corresponding to the plurality of cells to be tested are obtained, and then the protein expression levels corresponding to the plurality of cells to be tested are obtained. Expression level, the target cells whose protein expression levels meet the set conditions are determined from multiple cells to be tested, which realizes the rapid determination of cells with high protein expression levels, and avoids the need to undergo repeated culture and screening before cell screening can be performed. In addition, the application can quickly process millions of single cells, while increasing the range of cell screening, reducing the workload of staff and effectively improving the efficiency of cell screening.
附图说明Description of drawings
图1为一个实施例中一种基于卷积神经网络的细胞筛选方法的流程示意图;1 is a schematic flowchart of a cell screening method based on a convolutional neural network in one embodiment;
图2为一个实施例中一种模型训练的步骤的流程示意图;Fig. 2 is a schematic flowchart of steps of model training in one embodiment;
图3a为一个实施例中一种训练细胞灰度图;Figure 3a is a grayscale image of a training cell in one embodiment;
图3b为一个实施例中一种训练细胞荧光图;Figure 3b is a fluorescence image of a training cell in one embodiment;
图4为一个实施例中另一种模型训练的步骤的流程示意图;4 is a schematic flowchart of steps of another model training in one embodiment;
图5为一个实施例中一种模型参数调整的步骤的流程示意图;5 is a schematic flowchart of a step of model parameter adjustment in one embodiment;
图6为一个实施例中一种预测蛋白表达量的步骤的流程示意图;6 is a schematic flowchart of a step of predicting protein expression in one embodiment;
图7为一个实施例中一种基于卷积神经网络的细胞筛选装置的结构框图;7 is a structural block diagram of a cell screening device based on a convolutional neural network in one embodiment;
图8为一个实施例中计算机设备的内部结构图。FIG. 8 is a diagram of the internal structure of a computer device in one embodiment.
具体实施方式detailed description
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solutions and advantages of the present application more clearly understood, the present application will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.
为了便于对本发明实施例的理解,先对现有技术进行说明。In order to facilitate the understanding of the embodiments of the present invention, the prior art is first described.
在现有技术中,在获取用于培养单克隆细胞株的细胞时,可以先对细胞池中的细胞进行转染,并采用有限稀释法对细胞池进行处理,得到单个细胞,进而可以采用单个细胞培养具有同质性的细胞群体,即细胞株,并筛选其中目的蛋白表达量高的细胞株。In the prior art, when obtaining cells for culturing a monoclonal cell line, the cells in the cell pool can be transfected first, and the cell pool can be processed by a limiting dilution method to obtain a single cell, and then a single cell can be obtained. Cells are cultured with homogeneous cell populations, namely cell lines, and the cell lines with high target protein expression are screened.
然而,采用有限稀释法获取单细胞的过程较为繁琐,需要反复地培养和筛选,同时,由于细胞转染效率问题,目的蛋白表达水平高的细胞比例较低,导致筛选细胞筛选工作效率较低,筛选周期长,传统方法往往需要耗时6个月甚至更多,在耗费大量人力物力支持的同时,难以满足规模化、产业化的需求。However, the process of obtaining single cells by the limiting dilution method is cumbersome and requires repeated cultivation and screening. At the same time, due to the problem of cell transfection efficiency, the proportion of cells with high expression levels of the target protein is low, resulting in low efficiency of screening cells. The screening cycle is long, and traditional methods often take 6 months or more. While consuming a lot of human and material resources, it is difficult to meet the needs of scale and industrialization.
在一个实施例中,如图1所示,提供了一种基于卷积神经网络的细胞筛选方法,本实施例以该方法应用于终端进行举例说明,可以理解的是,该方法也可以应用于服务器,还可以应用于包括终端和服务器的系统,并通过终端和服务器的交互实现该方法。本实施例中,该方法包括以下步骤:In one embodiment, as shown in FIG. 1 , a cell screening method based on a convolutional neural network is provided. In this embodiment, the method is applied to a terminal for illustration. It can be understood that the method can also be applied to The server can also be applied to a system including a terminal and a server, and the method is implemented through the interaction between the terminal and the server. In this embodiment, the method includes the following steps:
步骤101,获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图。Step 101: Obtain grayscale images of cells to be tested corresponding to a plurality of cells to be tested in the cell culture tank.
作为一示例,待测细胞可以是经过转染技术处理后的细胞,待测细胞可以是在转染技术处理后未能获得外源DNA片段的细胞,也可以是已获得外源DNA片段但未整合到染色体中的细胞,或者是外源DNA片段已整合到染色体中的细胞。待测细胞灰度图是待测细胞的灰度图片。As an example, the cells to be tested may be cells treated with transfection technology, the cells to be tested may be cells that fail to obtain exogenous DNA fragments after treatment with transfection technology, or cells that have obtained exogenous DNA fragments but not A cell that has been integrated into a chromosome, or a cell in which a foreign DNA segment has been integrated into a chromosome. The grayscale image of the cells to be tested is a grayscale image of the cells to be tested.
在实际应用中,可以对细胞培养池中的多个细胞进行转染,使得细胞培养池中的部分或全部细胞可以获得外源DNA片段。在进行转染技术处理后,可以获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图。In practical applications, multiple cells in the cell culture pool can be transfected, so that some or all of the cells in the cell culture pool can obtain exogenous DNA fragments. After the transfection technique is performed, grayscale images of the cells to be tested corresponding to the plurality of cells to be tested in the cell culture tank can be obtained.
步骤102,将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型;所述目标卷积神经网络模型根据具有表达量标签的多张训练细胞灰度图训练得到,所述表达量标签用于表征各训练细胞灰度图中细胞的真实蛋白表达量,所述目标卷积神经网络模型用于检测输入目标卷积神经网络模型的待测细胞灰度图中细胞的蛋白表达量。Step 102: Inputting multiple grayscale images of cells to be tested corresponding to multiple cells to be tested into a target convolutional neural network model; the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels. , the expression label is used to represent the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the cells in the grayscale image of the cells to be tested that are input to the target convolutional neural network model. protein expression.
作为一示例,训练细胞灰度图是作为训练样本的图片,其用于训练卷积神经网络模型, 训练细胞灰度图中的细胞为经过转染处理后的细胞。As an example, the grayscale image of the training cells is a picture used as a training sample, which is used to train a convolutional neural network model, and the cells in the grayscale image of the training cells are cells that have been transfected.
在实际应用中,可以采用具有表达量标签的多张训练细胞灰度图对卷积神经网络模型进行训练,得到目标卷积神经网络模型。其中,训练细胞灰度图对应的表达量标签,可以表征训练细胞灰度图中细胞的真实蛋白表达量,由于基因主要的表达产物是蛋白质,通过真实蛋白表达量,可以确定训练细胞灰度图中细胞对目的基因的表达情况。In practical applications, the convolutional neural network model can be trained by using multiple grayscale images of training cells with expression labels to obtain the target convolutional neural network model. Among them, the expression label corresponding to the grayscale image of the training cell can represent the real protein expression of the cell in the grayscale image of the training cell. Since the main expression product of the gene is protein, the grayscale image of the training cell can be determined through the actual protein expression. expression of target genes in cells.
在得到多个待测细胞分别对应的待测细胞灰度图后,可以将多张待测细胞灰度图输入经过训练后的目标卷积神经网络模型,通过目标卷积神经网络模型,可以检测待测细胞灰度图中细胞的蛋白表达量。After obtaining the grayscale images of the cells to be tested corresponding to the cells to be tested, the grayscale images of the cells to be tested can be input into the trained target convolutional neural network model, and the target convolutional neural network model can be used to detect The protein expression of the cells in the grayscale image of the cells to be tested.
步骤103,根据所述目标卷积神经网络模型的输出,得到所述多个待测细胞分别对应的蛋白表达量。 Step 103 , according to the output of the target convolutional neural network model, obtain the respective protein expression levels corresponding to the plurality of cells to be tested.
作为一示例,待测细胞对应的蛋白表达量为待测细胞灰度图中细胞的蛋白表达量,该蛋白表达量是通过目标卷积神经网络模型预测得到的。As an example, the protein expression level corresponding to the cells to be tested is the protein expression level of the cells in the grayscale image of the cells to be tested, and the protein expression level is predicted by the target convolutional neural network model.
在将多张待测细胞灰度图输入目标卷积神经网络模型后,可以根据目标卷积神经网络模型的输出,获取多个待测细胞分别对应的蛋白表达量。After inputting the grayscale images of the cells to be tested into the target convolutional neural network model, the protein expression levels corresponding to the cells to be tested can be obtained according to the output of the target convolutional neural network model.
步骤104,根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。 Step 104 , according to the protein expression levels corresponding to the plurality of cells to be tested, determine, from the plurality of cells to be tested, target cells whose protein expression meets the set condition.
在得到多个待测细胞对应的蛋白表达量后,可以对多个待测细胞进行筛选,从多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。After the protein expression levels corresponding to the plurality of cells to be tested are obtained, the plurality of cells to be tested can be screened, and the target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
在本实施例中,通过获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图,将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型,并根据目标卷积神经网络模型的输出,得到多个待测细胞分别对应的蛋白表达量,进而根据多个待测细胞对应的蛋白表达量,从多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞,实现了快速确定具有高蛋白表达量的细胞,避免需要经过反复培养和筛选后才能进行细胞筛选,大大缩短的筛选周期,并且,通过本申请可以快速处理上百万的单细胞,在增加细胞筛选范围的同时,减少了工作人员的工作量,有效提升细胞筛选效率。In this embodiment, by acquiring the grayscale images of the cells to be tested corresponding to the cells to be tested in the cell culture tank, the grayscale images of the cells to be tested corresponding to the cells to be tested are input into the target convolutional neural network model , and according to the output of the target convolutional neural network model, the protein expression levels corresponding to the multiple cells to be tested are obtained, and then the protein expression levels are determined from the multiple cells to be tested according to the protein expression levels corresponding to the multiple cells to be tested. Target cells that meet the set conditions can quickly determine cells with high protein expression, avoid the need for repeated culture and screening before cell screening, greatly shorten the screening cycle, and quickly process millions of cells through this application The single cell can increase the range of cell screening, reduce the workload of the staff, and effectively improve the efficiency of cell screening.
在一个实施例中,如图2所示,基于卷积神经网络的细胞筛选方法还可以包括如下步骤:In one embodiment, as shown in FIG. 2 , the cell screening method based on convolutional neural network may further include the following steps:
步骤201,获取训练细胞灰度图及其对应的训练细胞荧光图。 Step 201 , acquiring a training cell grayscale image and its corresponding training cell fluorescence image.
在具体实现中,可以设置作为训练集的细胞,并对该细胞进行拍摄,分别获取训练细胞灰度图和对应的训练细胞荧光图。其中,训练集的细胞可用于训练卷积神经网络模型,训练细胞灰度图和对应的训练细胞荧光图,可以是在相同的拍摄条件下,针对相同的细胞进行拍摄所得到的灰度图和荧光图。In a specific implementation, a cell as a training set can be set, and the cell can be photographed to obtain a grayscale image of the training cell and a corresponding fluorescence image of the training cell, respectively. Among them, the cells in the training set can be used to train the convolutional neural network model, and the grayscale image of the training cells and the corresponding fluorescence image of the training cells can be the grayscale images obtained by shooting the same cells under the same shooting conditions and Fluorescence map.
作为训练集的细胞,可以是细胞培养池中经过转染技术处理后的细胞,该细胞可以是在转染技术处理后未能获得外源DNA片段的细胞,也可以是已获得外源DNA片段但未整合到染色体中的细胞,或者是外源DNA片段已整合到染色体中的细胞。The cells in the training set can be cells in the cell culture pool that have been processed by transfection technology. The cells can be cells that have not been able to obtain exogenous DNA fragments after the transfection technology treatment, or cells that have obtained exogenous DNA fragments. But cells that are not integrated into chromosomes, or cells that have foreign DNA fragments integrated into chromosomes.
针对细胞培养池中经过转染技术处理的同一批细胞,可以在相同的拍摄条件下,使用显微镜同时拍摄灰度图和荧光图,得到的灰度图和荧光图中可以包括一个或多个细胞,灰度图 中每个细胞的坐标与荧光图中该细胞的坐标对应。由于同一灰度图和荧光图中可以同时包含多个细胞,在得到灰度图和荧光图后,可以对灰度图和荧光图进行分割,得到单细胞对应的训练细胞灰度图和训练细胞荧光图,如图3a和图3b所示。For the same batch of cells in the cell culture pool treated with transfection technology, the grayscale image and the fluorescence image can be captured simultaneously with a microscope under the same shooting conditions, and the obtained grayscale image and fluorescence image can include one or more cells , the coordinates of each cell in the grayscale image correspond to the coordinates of that cell in the fluorescence image. Since the same grayscale image and fluorescence image can contain multiple cells at the same time, after obtaining the grayscale image and the fluorescence image, the grayscale image and the fluorescence image can be segmented to obtain the training cell grayscale image and training cell corresponding to a single cell. The fluorescence images are shown in Fig. 3a and Fig. 3b.
步骤202,根据所述训练细胞荧光图,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,基于所述真实蛋白表达量得到所述训练细胞灰度图对应的表达量标签。 Step 202 , according to the training cell fluorescence map, determine the actual protein expression level of the cells in the corresponding training cell grayscale map, and obtain an expression level label corresponding to the training cell grayscale map based on the actual protein expression level.
在获取训练细胞荧光图后,可以根据训练细胞荧光图,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,并基于真实蛋白表达量得到训练细胞灰度图对应的表达量标签。After acquiring the training cell fluorescence map, the actual protein expression level of the cells in the corresponding training cell grayscale map can be determined according to the training cell fluorescence map, and the expression level label corresponding to the training cell grayscale map can be obtained based on the real protein expression level.
步骤203,采用所述训练细胞灰度图和所述表达量标签训练卷积神经网络模型,生成目标卷积神经网络模型。 Step 203 , using the training cell grayscale image and the expression label to train a convolutional neural network model to generate a target convolutional neural network model.
在得到表达量标签后,可以采用训练细胞灰度图和对应的表达量标签,对卷积神经网络模型进行训练,生成目标卷积神经网络模型。After the expression label is obtained, the training cell grayscale image and the corresponding expression label can be used to train the convolutional neural network model to generate the target convolutional neural network model.
在本实施例中,通过采用训练细胞灰度图和表达量标签训练卷积神经网络模型,生成目标卷积神经网络模型,可以建立细胞灰度图以及细胞灰度图中细胞的蛋白表达量两者之间的关系,为快速筛选高蛋白表达量的细胞提供模型支撑。In this embodiment, by using the training cell grayscale map and the expression label to train the convolutional neural network model to generate the target convolutional neural network model, the cell grayscale map and the cell protein expression level in the cell grayscale map can be established. The relationship between them provides model support for the rapid screening of cells with high protein expression.
在一个实施例中,所述根据所述训练细胞荧光图,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,基于所述真实蛋白表达量得到所述训练细胞灰度图对应的表达量标签,可以包括如下步骤:In one embodiment, the actual protein expression of cells in the corresponding training cell grayscale image is determined according to the training cell fluorescence image, and the expression corresponding to the training cell grayscale image is obtained based on the actual protein expression. quantity label, which can include the following steps:
确定所述训练细胞荧光图中绿色通道的数值;根据所述训练细胞荧光图中绿色通道的数值,通过累加荧光图中细胞区域的绿色通道数值,确定对应的训练细胞灰度图中细胞的真实蛋白表达量;将所述真实蛋白表达量确定为所述训练细胞灰度图对应的表达量标签。Determine the value of the green channel in the fluorescence image of the training cells; according to the value of the green channel in the fluorescence image of the training cells, by accumulating the value of the green channel in the cell area in the fluorescence image, determine the true value of the cell in the grayscale image of the corresponding training cell Protein expression level; the real protein expression level is determined as the expression level label corresponding to the grayscale image of the training cells.
在实际应用中,通过目的基因(例如外源DNA片段)生成的蛋白质,可以在特定波长下发出荧光。在得到训练细胞荧光图后,可以确定训练细胞荧光图中绿色通道对应的数值(也可以称为荧光值),并根据该数值确定对应的训练细胞灰度图中细胞的真实蛋白表达量,进而可以将真实蛋白表达量确定为训练细胞灰度图对应的表达量标签。其中,荧光值与蛋白表达量之间可以为正相关的关系,通过获取荧光值与蛋白表达量之间的数量映射关系,可以通过绿色亮度值确定真实蛋白表达量。In practical applications, proteins produced by genes of interest, such as exogenous DNA fragments, can fluoresce at specific wavelengths. After the training cell fluorescence map is obtained, the value corresponding to the green channel in the training cell fluorescence map (also called the fluorescence value) can be determined, and the real protein expression of the cells in the corresponding training cell grayscale image can be determined according to the value, and then The real protein expression can be determined as the expression label corresponding to the grayscale image of the training cells. Among them, the relationship between the fluorescence value and the protein expression amount can be a positive correlation, and by obtaining the quantitative mapping relationship between the fluorescence value and the protein expression amount, the real protein expression amount can be determined by the green brightness value.
在本实施例中,通过根据训练细胞荧光图中绿色通道对应的数值,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,并将真实蛋白表达量确定为训练细胞灰度图对应的表达量标签,能够以训练细胞荧光图绿色通道对应的数值作为中间变量,训练细胞灰度图中细胞的蛋白表达量进行量化,得到训练细胞灰度图对应的表达量标签,提供准确的真实蛋白表达量数据。In this embodiment, according to the value corresponding to the green channel in the training cell fluorescence image, the actual protein expression level of the cells in the corresponding training cell grayscale image is determined, and the actual protein expression level is determined as the corresponding training cell grayscale image. The expression label can use the value corresponding to the green channel of the training cell fluorescence image as an intermediate variable to quantify the protein expression of the cells in the grayscale image of the training cell, and obtain the expression label corresponding to the grayscale image of the training cell, providing accurate real protein Expression data.
在一个实施例中,如图4所示,所述采用所述训练细胞灰度图和所述表达量标签训练卷积神经网络模型,生成目标卷积神经网络模型,可以包括如下步骤:In one embodiment, as shown in FIG. 4 , the training of a convolutional neural network model by using the training cell grayscale image and the expression label to generate a target convolutional neural network model may include the following steps:
步骤401,将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量。Step 401: Input the grayscale image of the training cells into the convolutional neural network model, and determine the protein expression level corresponding to the grayscale image of the training cells.
在实际应用中,在对卷积神经网络模型进行训练时,可以将训练细胞灰度图输入至卷积 神经网络模型,并通过卷积神经网络模型的输出结果,确定训练细胞灰度图对应的蛋白表达量,其中,卷积神经网络模型用于对训练细胞灰度图中细胞的蛋白表达量进行预测,在预测蛋白表达量时,卷积神经网络模型是以训练细胞灰度图作为输入,预测对应的蛋白表达量。In practical applications, when training the convolutional neural network model, the training cell grayscale image can be input into the convolutional neural network model, and the output result of the convolutional neural network model can be used to determine the corresponding grayscale image of the training cell. The protein expression level, where the convolutional neural network model is used to predict the protein expression level of the cells in the grayscale image of the training cells. When predicting the protein expression level, the convolutional neural network model takes the grayscale image of the training cells as input. Predict the corresponding protein expression level.
步骤402,根据所述训练细胞灰度图对应的蛋白表达量和所述表达量标签,确定训练误差。Step 402: Determine the training error according to the protein expression level corresponding to the grayscale image of the training cells and the expression level label.
在获取训练细胞灰度图对应的蛋白表达量后,由于训练细胞灰度图对应的蛋白表达量是卷积神经网络模型预测的蛋白表达量,在训练过程的初期,该预测的蛋白表达量与真实的蛋白表达量存在差距,基于此,可以通过训练细胞灰度图对应的蛋白表达量和表达量标签,确定两者之间的训练误差。在实际应用中,可以通过代价函数,计算训练细胞灰度图对应的蛋白表达量与表达量标签之间的训练误差。After obtaining the protein expression level corresponding to the grayscale image of the training cells, since the protein expression level corresponding to the grayscale image of the training cell is the protein expression level predicted by the convolutional neural network model, at the beginning of the training process, the predicted protein expression level is the same as the protein expression level predicted by the convolutional neural network model. There is a gap between the real protein expression levels. Based on this, the training error between the two can be determined by training the protein expression level and expression level label corresponding to the grayscale image of the training cells. In practical applications, the cost function can be used to calculate the training error between the protein expression level corresponding to the grayscale image of the training cell and the expression level label.
步骤403,根据所述训练误差,通过减小误差对所述卷积神经网络模型的网络参数进行调整,以得到最优网络参数,并采用所述最优网络参数生成目标卷积神经网络模型。 Step 403 , according to the training error, adjust the network parameters of the convolutional neural network model by reducing the error to obtain optimal network parameters, and use the optimal network parameters to generate a target convolutional neural network model.
在确定训练误差后,可以根据训练误差,按照减小误差的调整目的对卷积神经网络模型的网络参数进行调整,直到得到最优网络参数。在确定最优网络参数后,可以基于该最优网络参数生成目标卷积神经网络模型。After the training error is determined, the network parameters of the convolutional neural network model can be adjusted according to the training error and the adjustment purpose of reducing the error until the optimal network parameters are obtained. After the optimal network parameters are determined, the target convolutional neural network model can be generated based on the optimal network parameters.
在本实施例中,通过训练细胞灰度图对应的蛋白表达量和表达量标签确定训练误差,根据训练误差对卷积神经网络模型的网络参数进行调整,以得到最优网络参数,并采用最优网络参数生成目标卷积神经网络模型,可以通过预测的蛋白表达量与真实蛋白表达量之间的差距,不断地对卷积神经网络模型进行训练、优化。In this embodiment, the training error is determined by the protein expression level and the expression level label corresponding to the grayscale image of the training cells, and the network parameters of the convolutional neural network model are adjusted according to the training error to obtain the optimal network parameters, and the optimal network parameters are adopted. The optimal network parameters generate the target convolutional neural network model, which can continuously train and optimize the convolutional neural network model through the gap between the predicted protein expression and the actual protein expression.
在一个实施例中,所述卷积神经网络模型包括多层结构,所述根据所述训练误差,通过减小误差对所述卷积神经网络模型的网络参数进行调整,以得到最优网络参数,可以包括如下步骤:In one embodiment, the convolutional neural network model includes a multi-layer structure, and according to the training error, the network parameters of the convolutional neural network model are adjusted by reducing the error to obtain optimal network parameters , which can include the following steps:
判断所述训练误差是否收敛且小于预设误差阈值;若是,确定当前的卷积神经网络模型的网络参数为最优网络参数;若否,采用所述训练误差从所述卷积神经网络模型的最后一层反向传播,通过减小误差对所述卷积神经网络模型各层的网络参数进行调整,并返回将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量的步骤。Determine whether the training error converges and is smaller than the preset error threshold; if so, determine that the network parameters of the current convolutional neural network model are the optimal network parameters; if not, use the training error from the convolutional neural network model The last layer of backpropagation adjusts the network parameters of each layer of the convolutional neural network model by reducing the error, and returns to input the grayscale image of the training cells into the convolutional neural network model to determine the training Steps of protein expression levels corresponding to cell grayscale images.
在实际应用中,卷积神经网络模型可以包括多层结构,例如设置最大池化层、平均池化层和多层卷积层。在训练卷积神经网络模型时,判断训练误差是否收敛且小于预设误差阈值。In practical applications, convolutional neural network models can include multi-layer structures, such as setting max pooling layers, average pooling layers, and multi-layer convolutional layers. When training the convolutional neural network model, it is determined whether the training error converges and is smaller than the preset error threshold.
若是,确定训练细胞灰度图对应的蛋白表达量与真实蛋白表达量接近,可以将当前的卷积神经网络模型的网络参数确定为最优网络参数;若否,可以将训练误差从卷积神经网络模型的最后一层反向传播,按照减小误差的调整方向对卷积神经网络模型中各层的网络参数进行调整,在调整后,可以返回将训练细胞灰度图输入卷积神经网络模型,确定训练细胞灰度图对应的蛋白表达量的步骤。If yes, it is determined that the protein expression corresponding to the grayscale image of the training cells is close to the real protein expression, and the network parameters of the current convolutional neural network model can be determined as the optimal network parameters; The last layer of the network model is back-propagated, and the network parameters of each layer in the convolutional neural network model are adjusted according to the adjustment direction of reducing the error. After adjustment, the grayscale image of the training cells can be returned to the convolutional neural network model. , the steps of determining the protein expression level corresponding to the grayscale image of the training cells.
为了使本领域技术人员能够更好地理解上述步骤,以下通过一个例子对本申请实施例加以示例性说明,但应当理解的是,本申请实施例并不限于此。In order to enable those skilled in the art to better understand the above steps, an example is used below to illustrate the embodiment of the present application, but it should be understood that the embodiment of the present application is not limited thereto.
如图5所示,可以获取训练细胞灰度图和训练荧光图,在对卷积神经网络模型中各层的网络参数进行初始化后,可以将训练细胞灰度图输入卷积神经网络模型,前向传播得到输出值(即本申请中训练细胞灰度图对应的蛋白表达量),并计算输出值与真实值(即本申请中的表达量标签)的训练误差。As shown in Figure 5, the training cell grayscale image and the training fluorescence image can be obtained. After initializing the network parameters of each layer in the convolutional neural network model, the training cell grayscale image can be input into the convolutional neural network model. The output value (that is, the protein expression corresponding to the grayscale image of the training cells in this application) is obtained through propagation, and the training error between the output value and the real value (that is, the expression label in this application) is calculated.
在得到训练误差后,可以判断训练误差是否收敛且足够小,若是,可以将训练误差反向传播,采用SGD(随机梯度下降)算法,或者其他优化算法更新各层的连接权重和偏置(即本申请中的网络参数),并重新通过前向传播,得到输出值;若否,可以确定当前的网络参数为最优网络参数,并基于最优网络参数生成目标卷积神经网络模型。After the training error is obtained, it can be judged whether the training error is converged and small enough. If so, the training error can be back-propagated, and the SGD (Stochastic Gradient Descent) algorithm or other optimization algorithm can be used to update the connection weights and biases of each layer (ie network parameters in this application), and pass forward propagation again to obtain the output value; if not, the current network parameters can be determined as the optimal network parameters, and the target convolutional neural network model is generated based on the optimal network parameters.
在实际应用中,细胞培养池中经过转染处理的细胞,可以划分为训练集、验证集和测试集,其中训练集的细胞对应训练细胞灰度图可用于训练卷积神经网络模型;验证集的细胞对应的细胞灰度图可以用于验证训练好的目标卷积神经网络模型,防止模型在训练集上过拟合,可通过验证集来确定训练过程中模型的准确程度;测试集的细胞,可以是本申请中的待测细胞,通过目标卷积神经网络模型,可以从多个待测细胞中确定蛋白表达量满足设定条件的目标细胞。In practical applications, the transfected cells in the cell culture pool can be divided into a training set, a validation set and a test set, in which the grayscale images of the cells in the training set corresponding to the training cells can be used to train the convolutional neural network model; the validation set The grayscale image of the cells corresponding to the cells can be used to verify the trained target convolutional neural network model to prevent the model from overfitting on the training set, and the accuracy of the model during the training process can be determined by the validation set; , which can be the cells to be tested in the present application. Through the target convolutional neural network model, the target cells whose protein expression meets the set conditions can be determined from a plurality of cells to be tested.
在本实施例中,判断训练误差是否收敛,若否,可以采用训练误差从卷积神经网络模型的最后一层反向传播,通过减小误差对卷积神经网络模型各层的网络参数进行调整,能够通过迭代计算对网络参数不断进行优化,直到预测的蛋白表达量接近真实蛋白表达量,提高目标卷积神经网络模型对蛋白表达量的预测准确性。In this embodiment, it is judged whether the training error has converged. If not, the training error can be back-propagated from the last layer of the convolutional neural network model, and the network parameters of each layer of the convolutional neural network model can be adjusted by reducing the error. , the network parameters can be continuously optimized through iterative calculation until the predicted protein expression is close to the real protein expression, which improves the prediction accuracy of the target convolutional neural network model for the protein expression.
在一个实施例中,卷积神经网络模型包括第一网络结构、第二网络结构、第三网络结构、第四网络结构和全连接层,训练细胞灰度图可以是多张训练细胞灰度图。In one embodiment, the convolutional neural network model includes a first network structure, a second network structure, a third network structure, a fourth network structure and a fully connected layer, and the training cell grayscale image may be a plurality of training cell grayscale images .
如图6所示,所述将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量,可以包括如下步骤:As shown in FIG. 6 , inputting the training cell grayscale image into the convolutional neural network model, and determining the protein expression level corresponding to the training cell grayscale image, may include the following steps:
步骤601,将多张训练细胞灰度图输入所述卷积神经网络模型。Step 601: Inputting multiple grayscale images of training cells into the convolutional neural network model.
在具体实现中,可以将多张训练细胞灰度图输入至卷积神经网络模型。In a specific implementation, multiple grayscale images of training cells can be input into the convolutional neural network model.
步骤602,针对每张训练细胞灰度图,通过所述第一网络结构获取对应的训练细胞特征;将所述训练细胞特征输入所述第二网络结构、第三网络结构和第四网络结构,得到对应的第一细胞特征、第二细胞特征和第三细胞特征。 Step 602, for each training cell grayscale image, obtain the corresponding training cell feature through the first network structure; input the training cell feature into the second network structure, the third network structure and the fourth network structure, The corresponding first cell feature, second cell feature and third cell feature are obtained.
针对每张训练细胞灰度图,可以通过第一网络结构获取训练细胞灰度图对应的训练细胞特征,并将训练细胞特征输入至第二网络结构、第三网络结构和第四网络结构,得到对应的第一细胞特征、第二细胞特征和第三细胞特征。For each training cell grayscale map, the training cell features corresponding to the training cell grayscale map can be obtained through the first network structure, and the training cell features can be input into the second network structure, the third network structure, and the fourth network structure to obtain Corresponding first cell signature, second cell signature and third cell signature.
步骤603,对所述第一细胞特征、第二细胞特征和第三细胞特征进行并行连接,得到每张训练细胞灰度图对应的特征融合结果;其中,所述第一细胞特征、第二细胞特征和第三细胞特征具有不同的抽象表达层次。Step 603: Connect the first cell feature, the second cell feature and the third cell feature in parallel to obtain a feature fusion result corresponding to each training cell grayscale image; wherein, the first cell feature, the second cell feature Features and tertiary cell features have different levels of abstract expression.
在得到第一细胞特征、第二细胞特征和第三细胞特征后,可以将第一细胞特征、第二细胞特征和第三细胞特征并行连接,得到每张训练细胞灰度图对应的特征融合结果,其中,第一细胞特征、第二细胞特征和第三细胞特征可以具有不同的抽象表达层次。After obtaining the first cell feature, the second cell feature and the third cell feature, the first cell feature, the second cell feature and the third cell feature can be connected in parallel to obtain the feature fusion result corresponding to each training cell grayscale image , wherein the first cell feature, the second cell feature, and the third cell feature can have different levels of abstract expression.
在具体实现中,第一网络结构可以是由10层卷积层组成的特征提取网络,第二网络结构可以是由11层卷积层和平均池化层组成,第三网络结构可以2层卷积层和最大池化层共同组成,第四网络结构可以是在第三网络结构后增加平均池化层组成的网络,即第四网络结构可以由2层卷积层、最大池化层和平均池化层组成。In a specific implementation, the first network structure may be a feature extraction network composed of 10 convolutional layers, the second network structure may be composed of 11 convolutional layers and an average pooling layer, and the third network structure may be a 2-layer convolutional layer. The accumulation layer and the maximum pooling layer are composed together, and the fourth network structure can be a network composed of an average pooling layer added after the third network structure, that is, the fourth network structure can be composed of 2 layers of convolutional layers, maximum pooling layers and average pooling layers. Composition of pooling layers.
在卷积神经网络模型中,浅层网络可以提取训练细胞灰度图中简单的特征,例如针对细胞形态、颜色、纹理和细胞边缘的特征提取,其反映的可以是细胞某一维度的具体特征,而深层网络提取的特征可以对浅层网络提取的特征进行抽象,获取可以反映细胞整体的细胞特征。基于此,在第一网络结构提取具体的训练细胞特征后,可以将训练细胞特征进一步输入至第二网络结构、第三网络结构和第四网络结构,通过不同层次的网络,得到具有不同抽象表达层次的细胞特征。In the convolutional neural network model, the shallow network can extract simple features in the grayscale image of the training cells, such as feature extraction for cell shape, color, texture and cell edge, which can reflect the specific features of a certain dimension of the cell , and the features extracted by the deep network can abstract the features extracted by the shallow network to obtain cell features that can reflect the overall cell. Based on this, after the specific training cell features are extracted from the first network structure, the training cell features can be further input into the second network structure, the third network structure and the fourth network structure, and through different levels of networks, different abstract expressions can be obtained. Hierarchical cellular characteristics.
在实际应用中,第一细胞特征、第二细胞特征和第三细胞特征可以通过矩阵形式输出,在得到第一细胞特征、第二细胞特征和第三细胞特征对应的矩阵后,每个矩阵可以在乘以不同的权值后,相加求和,该结果为特征融合结果,其中,矩阵的权值与网络结构提取的细胞特征占比呈正相关,即权重越大,网络结构提取的细胞特征占比越高。In practical applications, the first cell feature, the second cell feature, and the third cell feature can be output in the form of matrices. After the matrices corresponding to the first cell feature, the second cell feature, and the third cell feature are obtained, each matrix can be After multiplying by different weights, add and sum up, the result is the feature fusion result, in which the weight of the matrix is positively correlated with the proportion of cell features extracted by the network structure, that is, the larger the weight, the more cell features extracted by the network structure. higher proportion.
步骤604,将多张训练细胞灰度图分别对应的特征融合结果输入至所述全连接层,根据所述全连接层的输出结果确定多张训练细胞灰度图分别对应的蛋白表达量。Step 604: Input the feature fusion results corresponding to the grayscale images of the training cells to the fully connected layer, and determine the protein expression levels corresponding to the grayscale images of the training cells according to the output results of the fully connected layer.
在确定特征融合结果后,可以将多张训练细胞灰度图分别对应的特征融合结果输入至全连接层,并根据全连接层的输出结果,确定多张训练细胞灰度图分别对应的蛋白表达量。After the feature fusion results are determined, the feature fusion results corresponding to the grayscale images of the training cells can be input to the fully connected layer, and the protein expression corresponding to the grayscale images of the training cells can be determined according to the output results of the fully connected layer. quantity.
为了使本领域技术人员能够更好地理解上述步骤,以下通过一个例子对本申请实施例加以示例性说明,但应当理解的是,本申请实施例并不限于此。In order to enable those skilled in the art to better understand the above steps, an example is used below to illustrate the embodiment of the present application, but it should be understood that the embodiment of the present application is not limited thereto.
针对多张训练细胞灰度图,可以定义输入参数为B*3*448*448,B为每次训练卷积神经网络模型时输入网络的训练细胞灰度图数量,3表示图片通道数为R、G、B三个通道,448为图片的宽度和高度。For multiple grayscale images of training cells, the input parameters can be defined as B*3*448*448, where B is the number of grayscale images of training cells input to the network each time the convolutional neural network model is trained, and 3 means the number of image channels is R , G, B three channels, 448 is the width and height of the picture.
在将多张训练细胞灰度图输入第一网络结构并得到每张训练细胞灰度图对应的训练细胞特征后,针对每个训练细胞特征,可以分别输入至第二网络结构、第三网络结构和第四网络结构,第二网络结构、第三网络结构和第四网络结构可以分别输出大小为B*100的矩阵,即第一细胞特征、第二细胞特征和第三细胞特征,其中,矩阵大小可以在训练过程中调整,即数值100可以根据实际需要进行调整。After inputting multiple training cell grayscale images into the first network structure and obtaining the training cell features corresponding to each training cell grayscale image, each training cell feature can be input to the second network structure and the third network structure respectively. and the fourth network structure, the second network structure, the third network structure and the fourth network structure can respectively output a matrix of size B*100, that is, the first cell feature, the second cell feature and the third cell feature, where the matrix The size can be adjusted during training, that is, the value of 100 can be adjusted according to actual needs.
在得到3个大小为B*100的矩阵,每个矩阵可以分别乘以不同权值,并相加求和,得到一个B*100的矩阵,即特征融合结果,其中,权值可以是在0至1区间内变动的数。After obtaining 3 matrices of size B*100, each matrix can be multiplied by different weights and added and summed to obtain a matrix of B*100, that is, the result of feature fusion, where the weights can be 0 The number of changes in the interval to 1.
在获取特征融合结果后,可以将该结果输入至全连接层,全连接层的输入数量与矩阵的大小对应,输出数量为1,在本示例中,特征融合结果可以经过一个输入为100,输出为1的全连接层,得到输出形式为B*1的向量,该向量中的每个分量与一张训练细胞灰度图对应,分量的数值为第二蛋白表达量。After the feature fusion result is obtained, the result can be input to the fully connected layer. The number of inputs of the fully connected layer corresponds to the size of the matrix, and the number of outputs is 1. In this example, the result of feature fusion can pass through an input of 100 and output The fully-connected layer with 1 obtains a vector whose output is in the form of B*1. Each component in the vector corresponds to a grayscale image of the training cells, and the value of the component is the expression level of the second protein.
在一个实施例中,所述根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞,可以包括如下步骤:In one embodiment, according to the protein expression levels corresponding to the plurality of cells to be tested, determining from the plurality of cells to be tested the target cells whose protein expression meets the set condition may include the following steps:
对多个待测细胞对应的蛋白表达量进行排序,并从排序后的多个待测细胞对应的蛋白表达量中,将排序最前的预设数量的蛋白表达量确定为目标表达量;确定所述目标表达量对应的待测细胞灰度图,并将所述待测细胞灰度图对应的待测细胞确定为目标细胞。Sort the protein expression levels corresponding to the plurality of cells to be tested, and from the protein expression levels corresponding to the plurality of cells to be tested after sorting, determine the protein expression level of the first preset number as the target expression level; The grayscale map of the cells to be tested corresponding to the target expression level is determined, and the cells to be tested corresponding to the grayscale map of the cells to be tested are determined as target cells.
在具体实现中,在得到多个待测细胞分别对应的蛋白表达量后,可以对多个待测细胞对应的蛋白表达量进行排序,并从排序后的蛋白表达量中,将排序最前的预设数量的蛋白表达量确定为目标表达量。In a specific implementation, after obtaining the protein expression levels corresponding to the plurality of cells to be tested, the protein expression levels corresponding to the plurality of cells to be tested can be sorted, and from the sorted protein expression levels, the most advanced prediction The amount of protein expression was determined as the target expression level.
具体的,可以对多个待测细胞对应的蛋白表达量进行降序排列,即由大到小进行排序,在排序后,可以将前N名对应的蛋白表达量,确定为目标表达量。当然,在实际应用中,还可以将超过预设表达量阈值的蛋白表达量确定为目标表达量。Specifically, the protein expression levels corresponding to the plurality of cells to be tested can be sorted in descending order, that is, sorted from large to small. After sorting, the protein expression levels corresponding to the top N names can be determined as the target expression levels. Of course, in practical applications, the protein expression level exceeding the preset expression level threshold can also be determined as the target expression level.
在确定目标表达量后,可以确定目标表达量对应的待测细胞灰度图,并将待测细胞灰度图对应的待测细胞确定为目标细胞。该目标细胞可用于培养细胞株。After the target expression level is determined, the grayscale image of the cell to be tested corresponding to the target expression level can be determined, and the cell to be tested corresponding to the grayscale image of the cell to be tested is determined as the target cell. The target cells can be used to culture cell lines.
在本实施例中,对多个待测细胞对应的蛋白表达量进行排序,并根据排序后的多个待测细胞对应的蛋白表达量,从多个待测细胞中,将蛋白表达量排序最前的预设数量的细胞确定为目标细胞,能够快速筛选具有高蛋白表达量的细胞,大大减少了筛选工作量。In this embodiment, the protein expression levels corresponding to the plurality of cells to be tested are sorted, and according to the sorted protein expression levels corresponding to the plurality of cells to be tested, the protein expression levels of the plurality of cells to be tested are ranked first. The preset number of cells are determined as target cells, which can quickly screen cells with high protein expression, which greatly reduces the screening workload.
在一个实施例中,所述获取训练细胞灰度图,可以包括如下步骤:In one embodiment, the acquiring a grayscale image of the training cells may include the following steps:
获取用于模型训练的原始细胞灰度图,并对所述原始细胞灰度图进行归一化处理;对处理后的原始细胞灰度图进行数据增强处理,得到训练细胞灰度图;所述数据增强处理包括以下任一项或多项:旋转处理、翻转处理、对比度增强处理、随机剪裁处理。Obtain the original cell grayscale image used for model training, and perform normalization processing on the original cell grayscale image; perform data enhancement processing on the processed original cell grayscale image to obtain the training cell grayscale image; the Data enhancement processing includes any one or more of the following: rotation processing, inversion processing, contrast enhancement processing, and random cropping processing.
在具体实现中,可以获取用于模型训练的原始细胞灰度图,并对原始细胞灰度图进行归一化处理,其中,原始细胞灰度图可以是使用显微镜对作为训练集的细胞拍摄的灰度图。In a specific implementation, the original cell grayscale image used for model training can be obtained, and the original cell grayscale image can be normalized, wherein the original cell grayscale image can be photographed using a microscope on the cells used as the training set Grayscale image.
在进行归一化处理后,可以对处理后的原始细胞灰度图进行数据增强处理,例如对图像进行旋转、翻转、随机剪裁,或者增强图像的对比度。After normalization processing, data enhancement processing can be performed on the processed raw grayscale image, such as rotating, flipping, randomly cropping the image, or enhancing the contrast of the image.
在本实施例中,通过对处理后的原始细胞灰度图进行数据增强处理,得到训练细胞灰度图,可以增加用于训练卷积神经网络模型的训练细胞灰度图,在训练样本不足的情况下,快速扩大训练样本,为卷积神经网络模型的训练提供数据支撑。In this embodiment, by performing data enhancement processing on the processed original cell grayscale image, the training cell grayscale image is obtained, and the training cell grayscale image used for training the convolutional neural network model can be added. In this case, the training samples are rapidly expanded to provide data support for the training of the convolutional neural network model.
应该理解的是,虽然图1、2、4-6的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明以指明这些步骤的执行有严格的顺序限制,否则,这些步骤可以以其它的顺序执行。而且,图1、2、4-6中的至少一部分步骤可以包括多个步骤或者多个阶段,这些步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤中的步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the various steps in the flowcharts of FIGS. 1 , 2 , and 4-6 are displayed in sequence according to the arrows, these steps are not necessarily executed in the sequence indicated by the arrows. The steps may be performed in other orders unless explicitly stated herein to indicate a strict order restriction on the performance of the steps. Moreover, at least a part of the steps in FIGS. 1, 2, and 4-6 may include multiple steps or multiple stages. These steps or stages are not necessarily executed at the same time, but may be executed at different times. These steps Alternatively, the order of execution of the stages is not necessarily sequential, but may be performed alternately or alternately with other steps or at least a portion of the steps or stages in the other steps.
在一个实施例中,如图7所示,提供了一种基于卷积神经网络的细胞筛选装置,可以包括:In one embodiment, as shown in FIG. 7, a cell screening device based on a convolutional neural network is provided, which may include:
待测细胞灰度图获取模块701,用于获取细胞培养池中多个待测细胞分别对应的待测细 胞灰度图;The grayscale image acquisition module 701 of the cells to be tested is used to obtain grayscale images of the cells to be tested corresponding to a plurality of cells to be tested in the cell culture tank;
第一输入模块702,用于将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型;所述目标卷积神经网络模型根据具有表达量标签的多张训练细胞灰度图训练得到,所述表达量标签用于表征各训练细胞灰度图中细胞的真实蛋白表达量,所述目标卷积神经网络模型用于检测输入目标卷积神经网络模型的待测细胞灰度图中细胞的蛋白表达量;The first input module 702 is used to input multiple grayscale images of the cells to be tested corresponding to the multiple cells to be tested into the target convolutional neural network model; the target convolutional neural network model is based on multiple training cells with expression labels. The grayscale image training is obtained, the expression label is used to represent the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the cells to be tested that are input to the target convolutional neural network model. The protein expression of cells in the grayscale image;
蛋白表达量预测模块703,用于根据所述目标卷积神经网络模型的输出,得到所述多个待测细胞分别对应的蛋白表达量;A protein expression level prediction module 703, configured to obtain the respective protein expression levels corresponding to the multiple cells to be tested according to the output of the target convolutional neural network model;
细胞筛选模块704,用于根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。The cell screening module 704 is configured to determine, according to the protein expression levels corresponding to the plurality of cells to be tested, target cells whose protein expression meets the set condition from the plurality of cells to be tested.
在一个实施例中,基于卷积神经网络的细胞筛选装置可以还包括:In one embodiment, the cell screening device based on convolutional neural network may further include:
训练细胞灰度图获取模块,用于获取训练细胞灰度图及其对应的训练细胞荧光图;The training cell grayscale image acquisition module is used to obtain the training cell grayscale image and its corresponding training cell fluorescence image;
表达量标签确定模块,用于根据所述训练细胞荧光图,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,基于所述真实蛋白表达量得到所述训练细胞灰度图对应的表达量标签;The expression label determination module is used to determine the real protein expression of the cells in the corresponding training cell grayscale image according to the training cell fluorescence map, and obtain the expression corresponding to the training cell grayscale image based on the real protein expression quantity label;
训练模块,用于采用所述训练细胞灰度图和所述表达量标签训练卷积神经网络模型,生成目标卷积神经网络模型。A training module is used to train a convolutional neural network model by using the training cell grayscale map and the expression label to generate a target convolutional neural network model.
在一个实施例中,所述表达量标签确定模块,包括:In one embodiment, the expression quantity label determination module includes:
绿色亮度值确定子模块,用于确定所述训练细胞荧光图中绿色通道的数值;The green brightness value determination submodule is used to determine the value of the green channel in the fluorescence image of the training cells;
真实蛋白表达量确定子模块,用于根据所述训练细胞荧光图中绿色通道的数值,确定对应的训练细胞灰度图中细胞的真实蛋白表达量;The real protein expression determination submodule is used to determine the real protein expression of the cells in the corresponding training cell grayscale image according to the value of the green channel in the training cell fluorescence image;
表达量标签生成子模块,用于将所述真实蛋白表达量确定为所述训练细胞灰度图对应的表达量标签。The expression level label generation sub-module is used for determining the real protein expression level as the expression level label corresponding to the grayscale image of the training cells.
在一个实施例中,所述训练模块,包括:In one embodiment, the training module includes:
蛋白表达量确定子模块,用于将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量;A protein expression level determination submodule, configured to input the grayscale image of the training cells into the convolutional neural network model, and determine the protein expression level corresponding to the grayscale image of the training cells;
训练误差确定子模块,用于根据所述训练细胞灰度图对应的蛋白表达量和所述表达量标签,确定训练误差;a training error determination submodule, used for determining the training error according to the protein expression corresponding to the grayscale image of the training cells and the expression label;
参数调整子模块,用于根据所述训练误差,通过减小误差对所述卷积神经网络模型的网络参数进行调整,以得到最优网络参数,并采用所述最优网络参数生成目标卷积神经网络模型。A parameter adjustment sub-module, configured to adjust the network parameters of the convolutional neural network model by reducing the error according to the training error to obtain optimal network parameters, and use the optimal network parameters to generate a target convolution Neural network model.
在一个实施例中,所述卷积神经网络模型包括多层结构,所述参数调整子模块,包括:In one embodiment, the convolutional neural network model includes a multi-layer structure, and the parameter adjustment sub-module includes:
判断单元,用于判断所述训练误差是否收敛且小于预设误差阈值;若是,调用参数确定单元;若否,调用反向传播单元;a judgment unit for judging whether the training error has converged and is less than a preset error threshold; if so, call the parameter determination unit; if not, call the backpropagation unit;
参数确定单元,用于确定当前的卷积神经网络模型的网络参数为最优网络参数;a parameter determination unit, used for determining the network parameters of the current convolutional neural network model as the optimal network parameters;
反向传播单元,用于采用所述训练误差从所述卷积神经网络模型的最后一层反向传播,通过减小误差对所述卷积神经网络模型各层的网络参数进行调整,并返回将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量的步骤。A back-propagation unit, configured to use the training error to back-propagate from the last layer of the convolutional neural network model, adjust the network parameters of each layer of the convolutional neural network model by reducing the error, and return The step of inputting the training cell grayscale image into the convolutional neural network model, and determining the protein expression level corresponding to the training cell grayscale image.
在一个实施例中,所述卷积神经网络模型包括第一网络结构、第二网络结构、第三网络结构、第四网络结构和全连接层;所述训练细胞灰度图为多张训练细胞灰度图;In one embodiment, the convolutional neural network model includes a first network structure, a second network structure, a third network structure, a fourth network structure and a fully connected layer; the training cell grayscale image is a plurality of training cells grayscale image;
所述蛋白表达量确定子模块,包括:The protein expression level determination submodule includes:
第二输入单元,用于将多张训练细胞灰度图输入所述卷积神经网络模型;The second input unit is used for inputting a plurality of training cell grayscale images into the convolutional neural network model;
训练细胞特征获取单元,用于针对每张训练细胞灰度图,通过所述第一网络结构获取对应的训练细胞特征;将所述训练细胞特征输入所述第二网络结构、第三网络结构和第四网络结构,得到对应的第一细胞特征、第二细胞特征和第三细胞特征;The training cell feature acquisition unit is used to obtain the corresponding training cell feature through the first network structure for each training cell grayscale image; input the training cell feature into the second network structure, the third network structure and The fourth network structure obtains the corresponding first cell feature, second cell feature and third cell feature;
特征融合结果获取单元,用于对所述第一细胞特征、第二细胞特征和第三细胞特征进行并行连接,得到每张训练细胞灰度图对应的特征融合结果;其中,所述第一细胞特征、第二细胞特征和第二细胞特征具有不同的抽象表达层次;A feature fusion result acquisition unit, configured to connect the first cell feature, the second cell feature and the third cell feature in parallel to obtain a feature fusion result corresponding to each training cell grayscale image; wherein, the first cell feature The feature, the second cell feature, and the second cell feature have different levels of abstract expression;
结果输出单元,用于将多张训练细胞灰度图分别对应的特征融合结果输入至所述全连接层,根据所述全连接层的输出结果确定多张训练细胞灰度图分别对应的第二蛋白表达量。The result output unit is used to input the feature fusion results corresponding to the grayscale images of the training cells to the fully connected layer, and determine the second corresponding grayscale images of the training cells according to the output results of the fully connected layer. protein expression.
在一个实施例中,所述细胞筛选模块704,包括:In one embodiment, the cell screening module 704 includes:
排序子模块,用于对多个待测细胞对应的蛋白表达量进行排序,并从排序后的多个待测细胞对应的蛋白表达量中,将排序最前的预设数量的蛋白表达量确定为目标表达量;The sorting submodule is used to sort the protein expression levels corresponding to the multiple cells to be tested, and from the protein expression levels corresponding to the sorted multiple cells to be tested, determine the protein expression level of the first preset number as target expression level;
目标细胞确定子模块,用于确定所述目标表达量对应的待测细胞灰度图,并将所述待测细胞灰度图对应的待测细胞确定为目标细胞。The target cell determination submodule is used to determine the grayscale image of the cell to be tested corresponding to the target expression level, and to determine the cell to be tested corresponding to the grayscale image of the cell to be tested as the target cell.
在一个实施例中,所述训练细胞灰度图获取模块,包括:In one embodiment, the training cell grayscale image acquisition module includes:
原始细胞灰度图获取子模块,用于获取用于模型训练的原始细胞灰度图,并对所述原始细胞灰度图进行归一化处理;The original cell grayscale image acquisition sub-module is used to obtain the original cell grayscale image used for model training, and normalize the original cell grayscale image;
数据增强处理子模块,用于对处理后的原始细胞灰度图进行数据增强处理,得到训练细胞灰度图;所述数据增强处理包括以下任一项或多项:旋转处理、翻转处理、对比度增强处理、随机剪裁处理。The data enhancement processing submodule is used to perform data enhancement processing on the processed original cell grayscale image to obtain the training cell grayscale image; the data enhancement processing includes any one or more of the following: rotation processing, flip processing, contrast Enhanced processing, random cropping processing.
关于一种基于卷积神经网络的细胞筛选装置的具体限定可以参见上文中对于一种基于卷积神经网络的细胞筛选方法的限定,在此不再赘述。上述一种基于卷积神经网络的细胞筛选装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For a specific definition of a cell screening device based on a convolutional neural network, reference may be made to the above definition of a cell screening method based on a convolutional neural network, which will not be repeated here. Each module in the above-mentioned cell screening device based on convolutional neural network can be implemented in whole or in part by software, hardware and combinations thereof. The above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是终端,其内部结构图可以如图8所示。该计算机设备包括通过系统总线连接的处理器、存储器、通信接口、显示屏和输入装置。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的通信接口用于与外部的终端进行有线或无线方式的通信,无线方式可通过WIFI、运营商网络、NFC(近场通信)或其他技术实现。该计算机程序被处理器执行时以实现基于卷积神经网络的细胞筛选方法。该计算机设备的显示屏可以是液晶显示屏或者电子墨水显示屏,该计 算机设备的输入装置可以是显示屏上覆盖的触摸层,也可以是计算机设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。In one embodiment, a computer device is provided, and the computer device may be a terminal, and its internal structure diagram may be as shown in FIG. 8 . The computer equipment includes a processor, memory, a communication interface, a display screen, and an input device connected by a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium, an internal memory. The nonvolatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the execution of the operating system and computer programs in the non-volatile storage medium. The communication interface of the computer device is used for wired or wireless communication with an external terminal, and the wireless communication can be realized by WIFI, operator network, NFC (Near Field Communication) or other technologies. The computer program, when executed by a processor, implements a convolutional neural network based cell screening method. The display screen of the computer equipment may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment may be a touch layer covered on the display screen, or a button, a trackball or a touchpad set on the shell of the computer equipment , or an external keyboard, trackpad, or mouse.
本领域技术人员可以理解,图8中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 8 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
在一个实施例中,提供了一种计算机设备,包括存储器和处理器,存储器中存储有计算机程序,该处理器执行计算机程序时实现以下步骤:In one embodiment, a computer device is provided, including a memory and a processor, a computer program is stored in the memory, and the processor implements the following steps when executing the computer program:
获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图;Obtain the grayscale images of the cells to be tested corresponding to the plurality of cells to be tested in the cell culture pool;
将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型;所述目标卷积神经网络模型根据具有表达量标签的多张训练细胞灰度图训练得到,所述表达量标签用于表征各训练细胞灰度图中细胞的真实蛋白表达量,所述目标卷积神经网络模型用于检测输入模型的待测细胞灰度图中细胞的蛋白表达量;A plurality of grayscale images of cells to be tested corresponding to a plurality of cells to be tested are input into the target convolutional neural network model; the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels, and the said target convolutional neural network model is obtained by training The expression label is used to characterize the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the protein expression of the cells in the grayscale image of the cells to be tested of the input model;
根据所述目标卷积神经网络模型的输出,得到所述多个待测细胞分别对应的蛋白表达量;According to the output of the target convolutional neural network model, obtain the corresponding protein expression levels of the plurality of cells to be tested;
根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。According to the protein expression levels corresponding to the plurality of cells to be tested, target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
在一个实施例中,处理器执行计算机程序时还实现上述其他实施例中的步骤。In one embodiment, when the processor executes the computer program, it also implements the steps in the other embodiments described above.
在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现以下步骤:In one embodiment, a computer-readable storage medium is provided on which a computer program is stored, and when the computer program is executed by a processor, the following steps are implemented:
获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图;Obtain the grayscale images of the cells to be tested corresponding to the plurality of cells to be tested in the cell culture pool;
将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型;所述目标卷积神经网络模型根据具有表达量标签的多张训练细胞灰度图训练得到,所述表达量标签用于表征各训练细胞灰度图中细胞的真实蛋白表达量,所述目标卷积神经网络模型用于检测输入模型的待测细胞灰度图中细胞的蛋白表达量;A plurality of grayscale images of cells to be tested corresponding to a plurality of cells to be tested are input into the target convolutional neural network model; the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels, and the said target convolutional neural network model is obtained by training The expression label is used to characterize the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the protein expression of the cells in the grayscale image of the cells to be tested of the input model;
根据所述目标卷积神经网络模型的输出,得到所述多个待测细胞分别对应的蛋白表达量;According to the output of the target convolutional neural network model, obtain the corresponding protein expression levels of the plurality of cells to be tested;
根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。According to the protein expression levels corresponding to the plurality of cells to be tested, target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
在一个实施例中,计算机程序被处理器执行时还实现上述其他实施例中的步骤。In one embodiment, the computer program, when executed by the processor, also implements the steps in the other embodiments described above.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和易失性存储器中的至少一种。非易失性存储器可包括只读存储器(Read-Only Memory,ROM)、磁带、软盘、闪存或光存储器等。易失性存储器可包括随机存取存储器(Random Access Memory,RAM)或外部高速缓冲存储器。作为说明而非局限,RAM可以是多种形式,比如静态随机存取存储器(Static Random Access Memory,SRAM)或动态随机存取存储器(Dynamic  Random Access Memory,DRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through a computer program, and the computer program can be stored in a non-volatile computer-readable storage In the medium, when the computer program is executed, it may include the processes of the above-mentioned method embodiments. Wherein, any reference to memory, storage, database or other media used in the various embodiments provided in this application may include at least one of non-volatile and volatile memory. Non-volatile memory may include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory, or optical memory, and the like. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM may be in various forms, such as Static Random Access Memory (SRAM) or Dynamic Random Access Memory (DRAM), and the like.
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. In order to make the description simple, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features It is considered to be the range described in this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only represent several embodiments of the present application, and the descriptions thereof are specific and detailed, but should not be construed as a limitation on the scope of the invention patent. It should be pointed out that for those skilled in the art, without departing from the concept of the present application, several modifications and improvements can be made, which all belong to the protection scope of the present application. Therefore, the scope of protection of the patent of the present application shall be subject to the appended claims.

Claims (10)

  1. 一种基于卷积神经网络的细胞筛选方法,其特征在于,所述方法包括:A cell screening method based on convolutional neural network, characterized in that the method comprises:
    获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图;Obtain the grayscale images of the cells to be tested corresponding to the plurality of cells to be tested in the cell culture pool;
    将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型;所述目标卷积神经网络模型根据具有表达量标签的多张训练细胞灰度图训练得到,所述表达量标签用于表征各训练细胞灰度图中细胞的真实蛋白表达量,所述目标卷积神经网络模型用于检测输入模型的待测细胞灰度图中细胞的蛋白表达量;A plurality of grayscale images of cells to be tested corresponding to a plurality of cells to be tested are input into the target convolutional neural network model; the target convolutional neural network model is obtained by training a plurality of grayscale images of training cells with expression labels, and the said target convolutional neural network model is obtained by training The expression label is used to characterize the real protein expression of the cells in the grayscale image of each training cell, and the target convolutional neural network model is used to detect the protein expression of the cells in the grayscale image of the cells to be tested of the input model;
    根据所述目标卷积神经网络模型的输出,得到所述多个待测细胞分别对应的蛋白表达量;According to the output of the target convolutional neural network model, obtain the corresponding protein expression levels of the plurality of cells to be tested;
    根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。According to the protein expression levels corresponding to the plurality of cells to be tested, target cells whose protein expression levels meet the set conditions are determined from the plurality of cells to be tested.
  2. 根据权利要求1所述的方法,其特征在于,还包括:The method of claim 1, further comprising:
    获取训练细胞灰度图及其对应的训练细胞荧光图;Obtain the training cell grayscale image and its corresponding training cell fluorescence image;
    根据所述训练细胞荧光图,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,基于所述真实蛋白表达量得到所述训练细胞灰度图对应的表达量标签;According to the training cell fluorescence map, the actual protein expression level of the cells in the corresponding training cell grayscale map is determined, and the expression level label corresponding to the training cell grayscale map is obtained based on the actual protein expression level;
    采用所述训练细胞灰度图和所述表达量标签训练卷积神经网络模型,生成目标卷积神经网络模型。The convolutional neural network model is trained by using the training cell grayscale image and the expression label to generate a target convolutional neural network model.
  3. 根据权利要求2所述的方法,其特征在于,所述根据所述训练细胞荧光图,确定对应的训练细胞灰度图中细胞的真实蛋白表达量,基于所述真实蛋白表达量得到所述训练细胞灰度图对应的表达量标签,包括:The method according to claim 2, wherein, according to the training cell fluorescence map, the actual protein expression of the cells in the corresponding training cell grayscale image is determined, and the training is obtained based on the actual protein expression. Expression labels corresponding to cell grayscale images, including:
    确定所述训练细胞荧光图中绿色通道的数值;determining the value of the green channel in the training cell fluorescence image;
    根据所述训练细胞荧光图中绿色通道的数值,确定对应的训练细胞灰度图中细胞的真实蛋白表达量;According to the value of the green channel in the fluorescence image of the training cells, determine the actual protein expression of the cells in the corresponding gray-scale image of the training cells;
    将所述真实蛋白表达量确定为所述训练细胞灰度图对应的表达量标签。The real protein expression level is determined as the expression level label corresponding to the grayscale image of the training cells.
  4. 根据权利要求2所述的方法,其特征在于,所述采用所述训练细胞灰度图和所述表达量标签训练卷积神经网络模型,生成目标卷积神经网络模型,包括:The method according to claim 2, wherein the training a convolutional neural network model by using the training cell grayscale image and the expression label to generate a target convolutional neural network model, comprising:
    将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量;Inputting the training cell grayscale image into the convolutional neural network model, and determining the protein expression corresponding to the training cell grayscale image;
    根据所述训练细胞灰度图对应的蛋白表达量和所述表达量标签,确定训练误差;Determine the training error according to the protein expression corresponding to the grayscale image of the training cells and the expression label;
    根据所述训练误差,通过减小误差对所述卷积神经网络模型的网络参数进行调整,以得到最优网络参数,并采用所述最优网络参数生成目标卷积神经网络模型。According to the training error, the network parameters of the convolutional neural network model are adjusted by reducing the error to obtain optimal network parameters, and the target convolutional neural network model is generated by using the optimal network parameters.
  5. 根据权利要求4所述的方法,其特征在于,所述卷积神经网络模型包括多层结构,所述根据所述训练误差,通过减小误差对所述卷积神经网络模型的网络参数进行调整,以得到最优网络参数,包括:The method according to claim 4, wherein the convolutional neural network model comprises a multi-layer structure, and the network parameters of the convolutional neural network model are adjusted by reducing the error according to the training error , to get the optimal network parameters, including:
    判断所述训练误差是否收敛且小于预设误差阈值;judging whether the training error is converged and less than a preset error threshold;
    若是,确定当前的卷积神经网络模型的网络参数为最优网络参数;If so, determine the network parameters of the current convolutional neural network model as the optimal network parameters;
    若否,采用所述训练误差从所述卷积神经网络模型的最后一层反向传播,通过减小误差 对所述卷积神经网络模型各层的网络参数进行调整,并返回将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量的步骤。If not, use the training error to backpropagate from the last layer of the convolutional neural network model, adjust the network parameters of each layer of the convolutional neural network model by reducing the error, and return the training The cell grayscale image is input into the convolutional neural network model, and the step of determining the protein expression level corresponding to the training cell grayscale image.
  6. 根据权利要求4所述的方法,其特征在于,所述卷积神经网络模型包括第一网络结构、第二网络结构、第三网络结构、第四网络结构和全连接层;所述训练细胞灰度图为多张训练细胞灰度图;The method according to claim 4, wherein the convolutional neural network model comprises a first network structure, a second network structure, a third network structure, a fourth network structure and a fully connected layer; the training cell gray The degree map is a grayscale image of multiple training cells;
    所述将所述训练细胞灰度图输入所述卷积神经网络模型,确定所述训练细胞灰度图对应的蛋白表达量,包括:Inputting the training cell grayscale image into the convolutional neural network model, and determining the protein expression corresponding to the training cell grayscale image, including:
    将多张训练细胞灰度图输入所述卷积神经网络模型;inputting multiple grayscale images of training cells into the convolutional neural network model;
    针对每张训练细胞灰度图,通过所述第一网络结构获取对应的训练细胞特征;将所述训练细胞特征输入所述第二网络结构、第三网络结构和第四网络结构,得到对应的第一细胞特征、第二细胞特征和第三细胞特征;For each training cell grayscale image, obtain the corresponding training cell feature through the first network structure; input the training cell feature into the second network structure, the third network structure and the fourth network structure to obtain the corresponding a first cell feature, a second cell feature, and a third cell feature;
    对所述第一细胞特征、第二细胞特征和第三细胞特征进行并行连接,得到每张训练细胞灰度图对应的特征融合结果;其中,所述第一细胞特征、第二细胞特征和第三细胞特征具有不同的抽象表达层次;The first cell feature, the second cell feature and the third cell feature are connected in parallel to obtain the feature fusion result corresponding to each training cell grayscale image; wherein the first cell feature, the second cell feature and the third cell feature are Three-cell features have different levels of abstract expression;
    将多张训练细胞灰度图分别对应的特征融合结果输入至所述全连接层,根据所述全连接层的输出结果确定多张训练细胞灰度图分别对应的蛋白表达量。The feature fusion results corresponding to the multiple grayscale images of the training cells are input into the fully connected layer, and the protein expression levels corresponding to the multiple grayscale images of the training cells are determined according to the output results of the fully connected layer.
  7. 根据权利要求1所述的方法,其特征在于,所述根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞,包括:The method according to claim 1, wherein, according to the protein expression levels corresponding to the plurality of cells to be tested, the target cells whose protein expression meets the set condition are determined from the plurality of cells to be tested, comprising: :
    对多个待测细胞对应的蛋白表达量进行排序,并从排序后的多个待测细胞对应的蛋白表达量中,将排序最前的预设数量的蛋白表达量确定为目标表达量;Sorting the protein expression levels corresponding to the plurality of cells to be tested, and from the protein expression levels corresponding to the plurality of cells to be tested after sorting, determining the protein expression level of the first preset number of cells as the target expression level;
    确定所述目标表达量对应的待测细胞灰度图,并将所述待测细胞灰度图对应的待测细胞确定为目标细胞。The grayscale image of the cell to be tested corresponding to the target expression level is determined, and the cell to be tested corresponding to the grayscale image of the cell to be tested is determined as the target cell.
  8. 一种基于卷积神经网络的细胞筛选装置,其特征在于,所述装置包括:A cell screening device based on convolutional neural network, characterized in that the device comprises:
    待测细胞灰度图获取模块,用于获取细胞培养池中多个待测细胞分别对应的待测细胞灰度图;The grayscale image acquisition module of the cells to be tested is used to acquire the grayscale images of the cells to be tested corresponding to a plurality of cells to be tested in the cell culture tank;
    第一输入模块,用于将多个待测细胞对应的多张待测细胞灰度图输入目标卷积神经网络模型;所述目标卷积神经网络模型根据具有表达量标签的多张训练细胞灰度图训练得到,所述表达量标签用于表征各训练细胞灰度图中细胞的真实蛋白表达量,所述目标卷积神经网络模型用于检测输入模型的待测细胞灰度图中细胞的蛋白表达量;The first input module is used to input multiple grayscale images of the cells to be tested corresponding to the multiple cells to be tested into the target convolutional neural network model; the target convolutional neural network model The degree map training is obtained, the expression label is used to represent the actual protein expression of the cells in the grayscale map of each training cell, and the target convolutional neural network model is used to detect the input model. protein expression;
    蛋白表达量预测模块,用于根据所述目标卷积神经网络模型的输出,得到所述多个待测细胞分别对应的蛋白表达量;A protein expression level prediction module, configured to obtain the respective protein expression levels corresponding to the plurality of cells to be tested according to the output of the target convolutional neural network model;
    细胞筛选模块,用于根据多个待测细胞对应的蛋白表达量,从所述多个待测细胞中确定出蛋白表达量满足设定条件的目标细胞。The cell screening module is used to determine, from the plurality of cells to be tested, the target cells whose protein expression meets the set condition according to the corresponding protein expression of the plurality of cells to be tested.
  9. 一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,其特征在于,所述处理器执行所述计算机程序时实现权利要求1至7中任一项所述的基于卷积神经网络的细胞筛选方法的步骤。A computer device, comprising a memory and a processor, wherein the memory stores a computer program, characterized in that, when the processor executes the computer program, the convolutional neural-based convolutional neural network described in any one of claims 1 to 7 is implemented Steps of a cellular screening method for networks.
  10. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现权利要求1至7中任一项所述的基于卷积神经网络的细胞筛选方法的步骤。A computer-readable storage medium on which a computer program is stored, characterized in that, when the computer program is executed by a processor, the convolutional neural network-based cell screening method according to any one of claims 1 to 7 is realized A step of.
PCT/CN2021/114165 2020-08-26 2021-08-24 Convolutional neural network-based cell screening method and device WO2022042506A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010869638.4A CN112037862B (en) 2020-08-26 2020-08-26 Cell screening method and device based on convolutional neural network
CN202010869638.4 2020-08-26

Publications (1)

Publication Number Publication Date
WO2022042506A1 true WO2022042506A1 (en) 2022-03-03

Family

ID=73580914

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/114165 WO2022042506A1 (en) 2020-08-26 2021-08-24 Convolutional neural network-based cell screening method and device

Country Status (2)

Country Link
CN (1) CN112037862B (en)
WO (1) WO2022042506A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112017730B (en) * 2020-08-26 2022-08-09 深圳太力生物技术有限责任公司 Cell screening method and device based on expression quantity prediction model
CN112037862B (en) * 2020-08-26 2021-11-30 深圳太力生物技术有限责任公司 Cell screening method and device based on convolutional neural network
CN112861986B (en) * 2021-03-02 2022-04-22 广东工业大学 Method for detecting blood fat subcomponent content based on convolutional neural network
CN114360652B (en) * 2022-01-28 2023-04-28 深圳太力生物技术有限责任公司 Cell strain similarity evaluation method and similar cell strain culture medium formula recommendation method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180349770A1 (en) * 2016-02-26 2018-12-06 Google Llc Processing cell images using neural networks
CN109102515A (en) * 2018-07-31 2018-12-28 浙江杭钢健康产业投资管理有限公司 A kind of method for cell count based on multiple row depth convolutional neural networks
CN109815870A (en) * 2019-01-17 2019-05-28 华中科技大学 The high-throughput functional gene screening technique and system of cell phenotype image quantitative analysis
CN110826379A (en) * 2018-08-13 2020-02-21 中国科学院长春光学精密机械与物理研究所 Target detection method based on feature multiplexing and YOLOv3
CN112037862A (en) * 2020-08-26 2020-12-04 东莞太力生物工程有限公司 Cell screening method and device based on convolutional neural network

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040053876A1 (en) * 2002-03-26 2004-03-18 The Regents Of The University Of Michigan siRNAs and uses therof
US7655397B2 (en) * 2002-04-25 2010-02-02 The United States Of America As Represented By The Department Of Health And Human Services Selections of genes and methods of using the same for diagnosis and for targeting the therapy of select cancers
CN110992303B (en) * 2019-10-29 2023-12-22 平安科技(深圳)有限公司 Abnormal cell screening method and device, electronic equipment and storage medium
CN110838340B (en) * 2019-10-31 2020-07-10 军事科学院军事医学研究院生命组学研究所 Method for identifying protein biomarkers independent of database search

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180349770A1 (en) * 2016-02-26 2018-12-06 Google Llc Processing cell images using neural networks
CN109102515A (en) * 2018-07-31 2018-12-28 浙江杭钢健康产业投资管理有限公司 A kind of method for cell count based on multiple row depth convolutional neural networks
CN110826379A (en) * 2018-08-13 2020-02-21 中国科学院长春光学精密机械与物理研究所 Target detection method based on feature multiplexing and YOLOv3
CN109815870A (en) * 2019-01-17 2019-05-28 华中科技大学 The high-throughput functional gene screening technique and system of cell phenotype image quantitative analysis
CN112037862A (en) * 2020-08-26 2020-12-04 东莞太力生物工程有限公司 Cell screening method and device based on convolutional neural network

Also Published As

Publication number Publication date
CN112037862B (en) 2021-11-30
CN112037862A (en) 2020-12-04

Similar Documents

Publication Publication Date Title
WO2022042506A1 (en) Convolutional neural network-based cell screening method and device
WO2022042510A1 (en) Protein expression quantity prediction method and apparatus, computer device, and storage medium
Zhang et al. EfficientNet-B4-Ranger: A novel method for greenhouse cucumber disease recognition under natural complex environment
CN106529565B (en) Model of Target Recognition training and target identification method and device calculate equipment
Dittmar et al. ScreenMill: a freely available software suite for growth measurement, analysis and visualization of high-throughput screen data
CN106874688B (en) Intelligent lead compound based on convolutional neural networks finds method
CN114897779B (en) Cervical cytology image abnormal region positioning method and device based on fusion attention
WO2022042509A1 (en) Cell screening method and apparatus based on expression level prediction model
CN112101432B (en) Material microscopic image and performance bidirectional prediction method based on deep learning
US11756677B2 (en) System and method for interactively and iteratively developing algorithms for detection of biological structures in biological samples
CN111598213B (en) Network training method, data identification method, device, equipment and medium
CN108108762A (en) A kind of random forest classification method based on core extreme learning machine and parallelization for the classification of coronary heart disease data
CN111047563A (en) Neural network construction method applied to medical ultrasonic image
CN112819063B (en) Image identification method based on improved Focal loss function
CN109191434A (en) Image detecting system and detection method in a kind of cell differentiation
CN113408802B (en) Energy consumption prediction network training method and device, energy consumption prediction method and device, and computer equipment
Ray et al. Cuckoo search with differential evolution mutation and Masi entropy for multi-level image segmentation
CN115239946A (en) Small sample transfer learning training and target detection method, device, equipment and medium
CN113052217A (en) Prediction result identification and model training method and device thereof, and computer storage medium
CN111222529A (en) GoogLeNet-SVM-based sewage aeration tank foam identification method
KR101913952B1 (en) Automatic Recognition Method of iPSC Colony through V-CNN Approach
US11775822B2 (en) Classification model training using diverse training source and inference engine using same
CN113780146B (en) Hyperspectral image classification method and system based on lightweight neural architecture search
CN111862003B (en) Medical image target information acquisition method, device, equipment and storage medium
Itano et al. An automated image analysis and cell identification system using machine learning methods

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21860341

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 03/07/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21860341

Country of ref document: EP

Kind code of ref document: A1