CN114496095A - Modification site recognition method, system, device and storage medium - Google Patents

Modification site recognition method, system, device and storage medium Download PDF

Info

Publication number
CN114496095A
CN114496095A CN202210066721.7A CN202210066721A CN114496095A CN 114496095 A CN114496095 A CN 114496095A CN 202210066721 A CN202210066721 A CN 202210066721A CN 114496095 A CN114496095 A CN 114496095A
Authority
CN
China
Prior art keywords
protein
matrix
protein acetylation
acetylation
site
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210066721.7A
Other languages
Chinese (zh)
Inventor
李占潮
杨楠翔
王梦茹
彭冬冬
刘洁
胡党中
邓倩
刘雨琦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Pharmaceutical University
Original Assignee
Guangdong Pharmaceutical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Pharmaceutical University filed Critical Guangdong Pharmaceutical University
Priority to CN202210066721.7A priority Critical patent/CN114496095A/en
Publication of CN114496095A publication Critical patent/CN114496095A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/30Detection of binding sites or motifs

Abstract

The invention discloses a method, a system, a device and a storage medium for identifying modification sites, wherein the method comprises the following steps: obtaining protein acetylation modification sites and corresponding protein acetylation amino acid sequence fragments; performing two-dimensional matrix conversion on amino acids in the protein acetylated amino acid sequence fragment; performing continuous wavelet transform operation on the protein acetylation two-dimensional matrix characteristic; and inputting the scale matrix into a deep convolution neural network model, and outputting the recognition result of the protein acetylation site. The system comprises: the device comprises a sequence acquisition module, a matrix characteristic acquisition module, a scale matrix acquisition module and an identification module. By using the method, the protein acetylation modification sites can be identified in the proteome scale, and the method has the advantages of accuracy, rapidness, low cost and the like. The invention can be widely applied to the field of recognition of the post-translational modification sites of the protein as a method, a system, a device and a storage medium for recognizing the modification sites.

Description

Modification site recognition method, system, device and storage medium
Technical Field
The invention relates to the field of protein posttranslational modification site recognition, in particular to a modification site recognition method, a system, a device and a storage medium.
Background
Acetylation of proteins is one of the post-translational modifications of proteins, and refers to the dynamic process of adding/removing acetyl groups to/from lysine residues in proteins by the action of lysine acetyltransferase/lysine deacetylase. Researches show that protein acetylation modification participates in various biological reaction processes such as metabolism, transcriptional activation, subcellular localization, protein stabilization, stress reaction and the like, and is closely related to the occurrence and development of complex and serious diseases such as cancer, diabetes, cardiovascular diseases, neurodegenerative diseases and the like. Therefore, the identification of the protein acetylation modification site is not only helpful for further elucidating the action mechanism of protein acetylation and the relationship between protein posttranslational modification and function, but also has important research significance and application value for the identification of disease-related proteins and the development of related drugs. With the advent of the post-genomic era, and the rapid development of various sequencing technologies, a great deal of protein sequence data has been identified and generated. Although high resolution mass spectrometry instruments are able to identify potential protein acetylation modification sites. But the method has the defects of time consumption, labor waste and high cost, and cannot meet the requirement of large-scale proteomics research.
Disclosure of Invention
In order to solve the above technical problems, the present invention provides a method, a system, a device and a storage medium for identifying a modification site, which can identify a protein acetylation modification site on a proteome scale, and has the advantages of accuracy, rapidness, low cost, etc.
The first technical scheme adopted by the invention is as follows: a method of modifying site recognition comprising the steps of:
obtaining protein acetylation modification sites and corresponding protein acetylation amino acid sequence fragments;
performing two-dimensional matrix conversion on amino acids in the protein acetylation amino acid sequence fragment to obtain protein acetylation two-dimensional matrix characteristics;
performing continuous wavelet transform operation on the protein acetylation two-dimensional matrix characteristics based on a wavelet transform function to obtain a scale matrix;
and inputting the scale matrix into a deep convolution neural network model, and outputting the recognition result of the protein acetylation site.
Further, the step of obtaining the protein acetylation modification site and the corresponding protein acetylation amino acid sequence fragment specifically includes:
obtaining protein acetylation modification sites;
taking a protein acetylation modification site as a center, intercepting sequence segments respectively comprising 10 amino acid residues at the upstream and the downstream to obtain corresponding protein acetylation amino acid sequence segments.
Further, the step of performing two-dimensional matrix transformation on amino acids in the protein acetylated amino acid sequence fragment to obtain a protein acetylated two-dimensional matrix characteristic specifically includes:
respectively defining 20 natural amino acid residues as a two-dimensional characteristic matrix with the size of 7 multiplied by 5 to obtain a defined matrix;
and converting the corresponding protein acetylated amino acid sequence fragments into protein acetylated two-dimensional matrix characteristics according to the definition matrix.
Further, the step of performing continuous wavelet transform operation on the protein acetylation two-dimensional matrix characteristics based on the wavelet transform function to obtain a scale matrix specifically includes:
presetting a wavelet transformation function and a scale;
sequentially changing wavelet transformation scales and performing continuous wavelet transformation according to the wavelet transformation function and the row characteristics of the protein acetylation two-dimensional matrix characteristics to obtain a corresponding transformation coefficient under each scale;
and constructing a scale matrix by taking the corresponding transformation coefficient under each scale as a row vector.
Further, the building step of the deep convolutional neural network model specifically comprises:
collecting protein acetylation site data information, and processing by adopting a predefined amino acid two-dimensional characteristic matrix, a wavelet transformation function and a scale process to obtain a first training scale matrix;
obtaining a non-protein acetylation modification site sequence according to the collected protein acetylation site data information;
for a non-protein acetylation modification site sequence, processing by adopting a predefined amino acid two-dimensional characteristic matrix, a wavelet transformation function and a scale process to obtain a second training scale matrix;
taking the first training scale matrix as an input data positive sample and the second training scale matrix as an input data negative sample;
extracting a positive sample of input data, a corresponding positive sample mark of output data, a negative sample of input data and a corresponding negative sample mark of output data in a preset proportion, and taking the positive sample, the corresponding positive sample mark, the negative sample and the corresponding negative sample mark of the output data as a training set to train the deep convolutional neural network model;
and extracting input data positive samples, corresponding output data positive sample marks, input data negative samples and corresponding output data negative sample marks in a preset proportion to serve as a test set to test the deep convolutional neural network model.
Further, the step of obtaining a non-protein acetylation modification site sequence according to the collected data information of the protein acetylation site specifically includes:
searching the protein containing the protein acetylation modification site according to the collected data information of the protein acetylation site to obtain a lysine residue which is not annotated as the acetylation site;
taking a lysine residue which is not annotated as an acetylation site as a center, and intercepting sequence segments respectively comprising 10 amino acid residues at the upstream and the downstream to obtain a non-protein acetylation modification site sequence.
The second technical scheme adopted by the invention is as follows: a modification site recognition system comprising:
the sequence acquisition module is used for acquiring protein acetylation modification sites and corresponding protein acetylation amino acid sequence fragments;
the matrix characteristic acquisition module is used for carrying out two-dimensional matrix conversion on amino acids in the protein acetylation amino acid sequence fragments to obtain protein acetylation two-dimensional matrix characteristics;
the scale matrix acquisition module is used for carrying out continuous wavelet transform operation on the protein acetylation two-dimensional matrix characteristics based on a wavelet transform function to obtain a scale matrix;
and the recognition module is used for inputting the scale matrix into the deep convolutional neural network model and outputting the recognition result of the protein acetylation site.
The third technical scheme adopted by the invention is as follows: a modification site recognition device comprising:
at least one processor;
at least one memory for storing at least one program;
when executed by the at least one processor, cause the at least one processor to implement a modification site recognition method as described above.
The fourth technical scheme adopted by the invention is as follows: a storage medium having stored therein instructions executable by a processor, the storage medium comprising: the processor-executable instructions, when executed by the processor, are for implementing a modification site recognition method as described above.
The method, the system, the device and the storage medium have the advantages that: the method is based on the deep learning theory, adopts a data processing method to construct a theoretical model, and can quickly, accurately and efficiently identify the protein acetylation modification sites in the proteome scale, thereby being beneficial to the research of the protein acetylation modification mechanism and the identification of the post-translational modification sites of the disease-related proteins.
Drawings
FIG. 1 is a flow chart of the steps of a method of identifying a modification site according to the present invention;
FIG. 2 is a block diagram showing the structure of a modification site recognition system according to the present invention.
Detailed Description
The invention is described in further detail below with reference to the figures and the specific embodiments. The step numbers in the following embodiments are provided only for convenience of illustration, the order between the steps is not limited at all, and the execution order of each step in the embodiments can be adapted according to the understanding of those skilled in the art.
As shown in FIG. 1, the present invention provides a modification site recognition method comprising the steps of:
s1, acquiring protein acetylation modification sites and corresponding protein acetylation amino acid sequence fragments;
s1.1, obtaining protein acetylation modification sites;
s1.2, taking the protein acetylation modification site as a center, intercepting sequence segments respectively comprising 10 amino acid residues at the upstream and the downstream to obtain corresponding protein acetylation amino acid sequence segments.
S2, performing two-dimensional matrix transformation on amino acids in the protein acetylation amino acid sequence fragment to obtain protein acetylation two-dimensional matrix characteristics;
s2.1, respectively defining 20 natural amino acid residues as two-dimensional characteristic matrixes with the size of 7 multiplied by 5 to obtain defined matrixes;
specifically, defining the matrix includes the following:
cysteine C is represented as:
Figure BDA0003480485910000041
aspartic acid D is represented as:
Figure BDA0003480485910000042
lysine K is represented as:
Figure BDA0003480485910000043
tryptophan W is represented as:
Figure BDA0003480485910000044
Figure BDA0003480485910000051
tyrosine Y is represented as:
Figure BDA0003480485910000052
and S2.2, converting the corresponding protein acetylated amino acid sequence fragment into a protein acetylated two-dimensional matrix characteristic according to the defined matrix.
The following amino acid sequence fragments are taken as examples: tryptophan W, cysteine C, lysine K, aspartic acid D and tyrosine Y (abbreviation: WCKDY), which show the following transformation processes:
splicing the W, C, K, D and Y two-dimensional characteristic matrixes left and right according to the sequence of amino acid participating in the sequence fragment, and converting into protein acetylation two-dimensional matrix characteristics (with 7 rows and 25 columns):
[1 0 0 0 1 0 1 1 1 0 1 0 0 0 1 1 1 1 1 0 1 0 0 0 1
1 0 0 0 1 1 0 0 0 1 1 0 0 1 0 1 0 0 0 1 1 0 0 0 1
1 0 0 0 1 1 0 0 0 0 1 0 1 0 0 1 0 0 0 1 0 1 0 1 0
1 0 0 0 1 1 0 0 0 0 1 1 0 0 0 1 0 0 0 1 0 0 1 0 0
1 0 1 0 1 1 0 0 0 0 1 0 1 0 0 1 0 0 0 1 0 0 1 0 0
1 1 0 1 1 1 0 0 0 1 1 0 0 1 0 1 0 0 0 1 0 0 1 0 0
1 0 0 0 1 0 1 1 1 0 1 0 0 0 1 1 1 1 1 0 0 0 1 0 0]
s3, performing continuous wavelet transform operation on the protein acetylation two-dimensional matrix characteristics based on a wavelet transform function to obtain a scale matrix;
s3.1, presetting a wavelet transformation function and a scale;
specifically, the wavelet transform function formula is expressed as follows:
Figure BDA0003480485910000053
in the above formula, CWTf(a, b) are continuous wavelet transform coefficients, ψ is a wavelet function, f (t) is each row characteristic of a two-dimensional matrix characteristic of protein acetylation modification sites, a is a scale parameter, and b is a displacement factor. The wavelet function is defined as follows:
Figure BDA0003480485910000061
wherein, ω is0The center frequency.
Setting f (t) as the first row of the protein acetylation two-dimensional matrix features, i.e., f (t) ═ 1000101110100011111010001;
preferably, the f (t) feature is subjected to a continuous wavelet transform using the following formula, where a is 1,2, … …,48, and b is 1.
Figure BDA0003480485910000062
Preferably, in the continuous wavelet transform formula, the wavelet function is defined as follows:
Figure BDA0003480485910000063
wherein ω is0=3。
S3.2, sequentially changing wavelet transformation scales and carrying out continuous wavelet transformation according to the wavelet transformation function and the row characteristics of the protein acetylation two-dimensional matrix characteristics to obtain a corresponding transformation coefficient under each scale;
and S3.3, constructing a scale matrix by taking the corresponding transformation coefficient under each scale as a row vector.
And S4, inputting the scale matrix into the deep convolutional neural network model, and outputting the recognition result of the protein acetylation site.
Further as a preferred embodiment of the method, the step of constructing the deep convolutional neural network model specifically includes:
collecting protein acetylation site data information, and processing by adopting a predefined amino acid two-dimensional characteristic matrix, a wavelet transformation function and a scale process to obtain a first training scale matrix;
specifically, the data related to the Protein acetylation modification sites is obtained by screening data in a UniProtKB (the Universal Protein Resource knowledge database) database, and the specific steps are as follows: collecting protein information marked as human in a database (namely, Homo sapiens); human acetylated proteins annotated as acetylated (i.e., N6-acetyllysine) at lysine residues were collected. A lysine is an acetylation modification site if each lysine in the amino acid sequence of the human protein is annotated by "N6-acetyllysine"; collecting the amino acid sequence information of the acetylation modification sites of the protein.
Obtaining a non-protein acetylation modification site sequence according to the collected protein acetylation site data information;
for a non-protein acetylation modification site sequence, processing by adopting a predefined amino acid two-dimensional characteristic matrix, a wavelet transformation function and a scale process to obtain a second training scale matrix;
taking the first training scale matrix as an input data positive sample and the second training scale matrix as an input data negative sample;
extracting 80% of input data positive samples, corresponding output data positive sample labels, input data negative samples and corresponding output data negative sample labels to serve as a training set to train the deep convolutional neural network model;
and extracting 20% of input data positive samples, corresponding output data positive sample marks, input data negative samples and corresponding output data negative sample marks to be used as a test set to test the deep convolutional neural network model.
The deep convolutional neural network model adopts the following architecture and parameters: input layer (48 × 105), convolutional layer (32 5 × 5 filters, step size 2 × 2), active layer (ReLU), relaxation layer (Max relaxation, size step 2 × 2, step size 2 × 2), convolutional layer (64 5 × 5 filters, step size 2 × 2), active layer (ReLU), relaxation layer (Max relaxation, size step 2 × 2, step size 2 × 2), convolutional layer (128 5 × 5 filters, step size 2 × 2), active layer (ReLU), relaxation layer (Max relaxation, size step size 2 × 2, step size 2 × 2), fully connected layer (200 neurons, drout 0.5), fully connected layer (2 neurons), softmax layer, output layer (2 neurons). The stochastic gradient descent optimizer, momentum, learning rate and maximum training theoretic number were set to 0.9, 0.01 and 50, respectively. Results as shown in table 1, the training set achieved an overall accuracy of 60.20%, a sensitivity of 58.37%, a specificity of 62.03%, a mahalanobis correlation coefficient of 0.2041, and a subject work curve area of 0.3528, respectively. The test set yielded an overall accuracy of 61.25%, a sensitivity of 59.76%, a specificity of 62.74%, a mahalanobis correlation coefficient of 0.2250 and an area under the subject's working curve of 0.3388, respectively. These results demonstrate that the present embodiments are able to identify potential protein acetylation modification sites.
Further as a preferred embodiment of the method, the step of obtaining a sequence of a non-protein acetylation modification site according to the collected data information of the protein acetylation site specifically includes:
searching the protein containing the protein acetylation modification site according to the collected data information of the protein acetylation site to obtain a lysine residue which is not annotated as the acetylation site;
taking a lysine residue which is not annotated as an acetylation site as a center, and intercepting sequence segments respectively comprising 10 amino acid residues at the upstream and the downstream to obtain a non-protein acetylation modification site sequence.
As shown in fig. 2, a modification site recognition system comprising:
the sequence acquisition module is used for acquiring protein acetylation modification sites and corresponding protein acetylation amino acid sequence fragments;
the matrix characteristic acquisition module is used for performing two-dimensional matrix conversion on amino acids in the protein acetylation amino acid sequence fragment to obtain protein acetylation two-dimensional matrix characteristics;
the scale matrix acquisition module is used for carrying out continuous wavelet transform operation on the protein acetylation two-dimensional matrix characteristics based on a wavelet transform function to obtain a scale matrix;
and the recognition module is used for inputting the scale matrix into the deep convolutional neural network model and outputting the recognition result of the protein acetylation site.
The contents in the above method embodiments are all applicable to the present system embodiment, the functions specifically implemented by the present system embodiment are the same as those in the above method embodiment, and the beneficial effects achieved by the present system embodiment are also the same as those achieved by the above method embodiment.
A modification site recognition device:
at least one processor;
at least one memory for storing at least one program;
when executed by the at least one processor, cause the at least one processor to implement a modification site recognition method as described above.
The contents in the above method embodiments are all applicable to the present apparatus embodiment, the functions specifically implemented by the present apparatus embodiment are the same as those in the above method embodiments, and the advantageous effects achieved by the present apparatus embodiment are also the same as those achieved by the above method embodiments.
A storage medium having stored therein instructions executable by a processor, the storage medium comprising: the processor-executable instructions, when executed by the processor, are for implementing a modification site recognition method as described above.
The contents in the above method embodiments are all applicable to the present storage medium embodiment, the functions specifically implemented by the present storage medium embodiment are the same as those in the above method embodiments, and the advantageous effects achieved by the present storage medium embodiment are also the same as those achieved by the above method embodiments.
While the preferred embodiments of the present invention have been illustrated and described, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (9)

1. A method for identifying a modification site, comprising the steps of:
obtaining protein acetylation modification sites and corresponding protein acetylation amino acid sequence fragments;
performing two-dimensional matrix conversion on amino acids in the protein acetylation amino acid sequence fragment to obtain protein acetylation two-dimensional matrix characteristics;
performing continuous wavelet transform operation on the protein acetylation two-dimensional matrix characteristics based on a wavelet transform function to obtain a scale matrix;
and inputting the scale matrix into a deep convolution neural network model, and outputting the recognition result of the protein acetylation site.
2. The method for identifying a modification site according to claim 1, wherein the step of obtaining the acetylated modification site of the protein and the corresponding acetylated amino acid sequence fragment of the protein comprises:
obtaining protein acetylation modification sites;
taking a protein acetylation modification site as a center, intercepting sequence segments respectively comprising 10 amino acid residues at the upstream and the downstream to obtain corresponding protein acetylation amino acid sequence segments.
3. The method for identifying modification sites according to claim 2, wherein the step of performing two-dimensional matrix transformation on amino acids in the protein acetylated amino acid sequence fragment to obtain the protein acetylated two-dimensional matrix features specifically comprises:
respectively defining 20 natural amino acid residues as a two-dimensional characteristic matrix with the size of 7 multiplied by 5 to obtain a defined matrix;
and converting the corresponding protein acetylated amino acid sequence fragments into protein acetylated two-dimensional matrix characteristics according to the definition matrix.
4. The method for identifying acetylated protein modification sites of claim 3, wherein the step of performing continuous wavelet transform on the acetylated protein two-dimensional matrix features based on a wavelet transform function to obtain a scale matrix specifically comprises:
presetting a wavelet transformation function and a scale;
sequentially changing wavelet transformation scales and performing continuous wavelet transformation according to the wavelet transformation function and the row characteristics of the protein acetylation two-dimensional matrix characteristics to obtain a corresponding transformation coefficient under each scale;
and constructing a scale matrix by taking the corresponding transformation coefficient under each scale as a row vector.
5. The method for identifying modification sites according to claim 4, wherein the step of constructing the deep convolutional neural network model specifically comprises:
collecting protein acetylation site data information, and processing by adopting a predefined amino acid two-dimensional characteristic matrix, a wavelet transformation function and a scale process to obtain a first training scale matrix;
obtaining a non-protein acetylation modification site sequence according to the collected protein acetylation site data information;
for a non-protein acetylation modification site sequence, processing by adopting a predefined amino acid two-dimensional characteristic matrix, a wavelet transformation function and a scale process to obtain a second training scale matrix;
taking the first training scale matrix as an input data positive sample and the second training scale matrix as an input data negative sample;
extracting a positive sample of input data, a corresponding positive sample mark of output data, a negative sample of input data and a corresponding negative sample mark of output data in a preset proportion, and taking the positive sample, the corresponding positive sample mark, the negative sample and the corresponding negative sample mark of the output data as a training set to train the deep convolutional neural network model;
and extracting a positive sample of input data, a corresponding positive sample mark of output data, a negative sample of input data and a corresponding negative sample mark of output data in a preset proportion, and using the positive sample, the corresponding positive sample mark, the corresponding negative sample and the corresponding negative sample mark of the output data as a test set to test the deep convolutional neural network model.
6. The method for identifying a modification site according to claim 5, wherein the step of obtaining the sequence of the non-protein acetylation modification site according to the collected data information of the protein acetylation site comprises:
searching the protein containing the protein acetylation modification site according to the collected data information of the protein acetylation site to obtain a lysine residue which is not annotated as the acetylation site;
taking a lysine residue which is not annotated as an acetylation site as a center, and intercepting sequence segments respectively comprising 10 amino acid residues at the upstream and the downstream to obtain a non-protein acetylation modification site sequence.
7. A modification site recognition system comprising:
the sequence acquisition module is used for acquiring protein acetylation modification sites and corresponding protein acetylation amino acid sequence fragments;
the matrix characteristic acquisition module is used for carrying out two-dimensional matrix conversion on amino acids in the protein acetylation amino acid sequence fragments to obtain protein acetylation two-dimensional matrix characteristics;
the scale matrix acquisition module is used for carrying out continuous wavelet transform operation on the protein acetylation two-dimensional matrix characteristics based on a wavelet transform function to obtain a scale matrix;
and the recognition module is used for inputting the scale matrix into the deep convolutional neural network model and outputting the recognition result of the protein acetylation site.
8. A modification site recognition device, comprising:
at least one processor;
at least one memory for storing at least one program;
when executed by the at least one processor, cause the at least one processor to implement a method for recognition of acetylated modification sites according to any one of claims 1 to 6.
9. A storage medium having stored therein instructions executable by a processor, the storage medium comprising: the processor-executable instructions, when executed by a processor, are for implementing a modification site recognition method according to any one of claims 1 to 6.
CN202210066721.7A 2022-01-20 2022-01-20 Modification site recognition method, system, device and storage medium Pending CN114496095A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210066721.7A CN114496095A (en) 2022-01-20 2022-01-20 Modification site recognition method, system, device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210066721.7A CN114496095A (en) 2022-01-20 2022-01-20 Modification site recognition method, system, device and storage medium

Publications (1)

Publication Number Publication Date
CN114496095A true CN114496095A (en) 2022-05-13

Family

ID=81473241

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210066721.7A Pending CN114496095A (en) 2022-01-20 2022-01-20 Modification site recognition method, system, device and storage medium

Country Status (1)

Country Link
CN (1) CN114496095A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114927165A (en) * 2022-07-20 2022-08-19 深圳大学 Method, device, system and storage medium for identifying ubiquitination sites

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114927165A (en) * 2022-07-20 2022-08-19 深圳大学 Method, device, system and storage medium for identifying ubiquitination sites
WO2024016389A1 (en) * 2022-07-20 2024-01-25 深圳大学 Ubiquitination site identification method, apparatus and system, and storage medium

Similar Documents

Publication Publication Date Title
Gamboa et al. Validation of the rapid detection approach for enhancing the electronic nose systems performance, using different deep learning models and support vector machines
Gorodkin Comparing two K-category assignments by a K-category correlation coefficient
US7561975B2 (en) System, method, and computer program product for analyzing spectrometry data to identify and quantify individual components in a sample
Dean et al. Machine learning techniques for chemical identification using cyclic square wave voltammetry
CN104063714B (en) A kind of for fast face recognizer video monitoring, based on CUDA parallel computation and rarefaction representation
CN110349628B (en) Protein phosphorylation site recognition method, system, device and storage medium
US20190265319A1 (en) System and method for small molecule accurate recognition technology ("smart")
CN112507606A (en) Method for identifying underdetermined working modal parameters based on RBF network and detection method
CN114496095A (en) Modification site recognition method, system, device and storage medium
van den Berg et al. SPiCE: a web-based tool for sequence-based protein classification and exploration
Caiafa et al. Machine learning methods with noisy, incomplete or small datasets
Liu et al. Why can deep convolutional neural networks improve protein fold recognition? A visual explanation by interpretation
Fuhrmann et al. Software for automated analysis of DNA fingerprinting gels
Gruenstaeudl et al. Bioinformatic workflows for generating complete plastid genome sequences—an example from Cabomba (Cabombaceae) in the context of the phylogenomic analysis of the water-lily clade
Chen et al. Affnet: an attention-based feature-fused network for surface defect segmentation
Murphy et al. Self-supervised learning of cell type specificity from immunohistochemical images
DeLuca et al. Data processing and analysis for protein microarrays
Zhang et al. PHCNet: Pyramid Hierarchical-Convolution-Based U-Net for Crack Detection with Mixed Global Attention Module and Edge Feature Extractor
Li et al. Multidimensional scaling method for prediction of lysine glycation sites
Ampong et al. Optimization of imputation strategies for high-resolution gas chromatography–mass spectrometry (HR GC–MS) metabolomics data
CN113160886A (en) Cell type prediction system based on single cell Hi-C data
Xu et al. Scale-adaptive deep matching network for constrained image splicing detection and localization
Zacher et al. Analysis of Affymetrix ChIP-chip data using starr and R/Bioconductor
US20040029126A1 (en) Method For examining macromolecules
Love Statistical Modeling of High Dimensional Counts

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination