CN114998332A - Method and device for determining karyotype abnormal cells - Google Patents

Method and device for determining karyotype abnormal cells Download PDF

Info

Publication number
CN114998332A
CN114998332A CN202210918890.9A CN202210918890A CN114998332A CN 114998332 A CN114998332 A CN 114998332A CN 202210918890 A CN202210918890 A CN 202210918890A CN 114998332 A CN114998332 A CN 114998332A
Authority
CN
China
Prior art keywords
feature
karyotype
cell
target
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210918890.9A
Other languages
Chinese (zh)
Other versions
CN114998332B (en
Inventor
邝英兰
何妙燕
范献军
叶莘
陈成苑
陈鑫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhuhai Hengqin Shengao Yunzhi Technology Co ltd
Original Assignee
Zhuhai Hengqin Shengao Yunzhi Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhuhai Hengqin Shengao Yunzhi Technology Co ltd filed Critical Zhuhai Hengqin Shengao Yunzhi Technology Co ltd
Priority to CN202210918890.9A priority Critical patent/CN114998332B/en
Publication of CN114998332A publication Critical patent/CN114998332A/en
Application granted granted Critical
Publication of CN114998332B publication Critical patent/CN114998332B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/56Extraction of image or video features relating to colour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • G06V20/695Preprocessing, e.g. image segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • G06V20/698Matching; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10056Microscopic image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10064Fluorescence image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30024Cell structures in vitro; Tissue sections in vitro
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30096Tumor; Lesion

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Radiology & Medical Imaging (AREA)
  • Quality & Reliability (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The application provides a method and a device for determining karyotype abnormal cells, wherein the method comprises the following steps: determining a target karyotype association feature value corresponding to the cell to be identified based on a cell nucleus microscopic image of the cell to be identified, wherein the target karyotype association feature is obtained by screening from an initial karyotype association feature set based on a Relieff algorithm and an mrMR algorithm, and secondarily screening and determining a candidate karyotype association feature set based on the mrMR algorithm and a backward selection method, the initial karyotype association feature set comprises morphological features, texture features, color features and an edge feature subset, the candidate karyotype association feature set comprises at least one candidate morphological feature, and inputting the target karyotype association characteristic value corresponding to the cell to be identified into the trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result, so that the efficiency and accuracy of karyotype abnormal cell identification are ensured.

Description

Method and device for determining karyotype abnormal cells
Technical Field
The present application relates to the field of medical image processing technologies, and in particular, to a method and an apparatus for determining a karyotype abnormal cell.
Background
The target cells currently used for tumor-related cell interpretation in blood are monocytes and non-granulocytes, and therefore, before tumor-related cell interpretation is performed, non-interpretation target cells in a blood sample need to be filtered out to eliminate the influence on the accuracy of interpretation results. However, due to the influence of environmental factors, some of the non-interpretation target cells in the blood sample cannot be completely filtered out.
In order to avoid the influence of the non-interpretation target cells on the interpretation result, the prior art generally determines whether the cell karyotype is abnormal by manual observation, and excludes the cell karyotype when the tumor-related cell interpretation is performed. However, this method is time-consuming and labor-consuming, and has low accuracy, and cannot ensure the efficiency and accuracy of tumor-related cell interpretation.
Disclosure of Invention
The application provides a method and a device for identifying karyotype abnormal cells, which are used for improving the identification efficiency and accuracy of the karyotype abnormal cells and ensuring the interpretation efficiency and accuracy of tumor-related cells.
The application provides a method for determining karyotype abnormal cells, which comprises the following steps:
determining a target karyotype association characteristic value corresponding to the cell to be identified based on the cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype association feature is determined after a candidate karyotype association feature set is obtained by screening from an initial karyotype association feature set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and then secondary screening is carried out on the candidate karyotype association feature set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
inputting the target karyotype association characteristic value corresponding to the cell to be identified into a trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result corresponding to the target karyotype association characteristic value;
the karyotype abnormal cell determination model is obtained by training a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label.
According to the method for determining the karyotype abnormal cell provided by the application, the step of acquiring the cell nucleus microscopic image corresponding to the cell to be identified comprises the following steps:
segmenting the cell microscopic image corresponding to the blood sample based on a preset segmentation network to obtain an initial cell nucleus microscopic image corresponding to the blood sample;
determining candidate cell nucleuses with over segmentation based on the position relation of each cell nucleus and the number of fluorescence signal points in the cell nucleus in initial cell nucleus microscopic images corresponding to different fluorescence channels of the blood sample, and correcting the candidate cell nucleuses based on a convex hull detection algorithm and a convex defect algorithm to obtain cell nucleus corrected microscopic images corresponding to the blood sample;
and taking the cell nucleus correction microscopic image corresponding to the blood sample as the cell nucleus microscopic image corresponding to the cell to be identified.
According to the method for determining the karyotype abnormal cells, the step of correcting the candidate cell nucleus based on the convex hull detection algorithm and the convex defect algorithm specifically comprises the following steps:
determining a cell nucleus set formed by target cell nuclei of which the sum of the number of the fluorescence signal points is a preset value in the candidate cell nuclei;
merging the target cell nuclei in the cell nucleus set to obtain corresponding corrected cell nuclei;
determining a contour point set which is farthest from a convex hull on the contour of the modified cell nucleus based on a convex hull detection algorithm and a convex defect algorithm, and determining a target contour point in the contour point set based on a preset distance threshold, wherein the target contour point is a contour point in the contour point set, the distance between the contour point set and the convex hull of which exceeds the preset distance threshold;
determining whether the corrected cell nucleus is correct based on the number of the target contour points, and deleting the incorrect corrected cell nucleus.
According to the method for determining the karyotype abnormal cells, the candidate karyotype association feature set is obtained by screening from the initial karyotype association feature set based on the ReliefF algorithm and the maximum correlation minimum redundancy mRMR algorithm, and the method specifically comprises the following steps:
respectively performing feature screening on each feature subset in the initial karyotype associated feature set based on a Relieff algorithm and an mrMR algorithm to obtain at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
and combining the candidate morphological characteristics, the candidate texture characteristics, the candidate color characteristics and the candidate edge characteristics to obtain the candidate karyotype association characteristic set.
According to the method for determining the karyotype abnormal cells, the feature screening is respectively performed on each feature subset in the initial karyotype-associated feature set based on the ReliefF algorithm and the mRMR algorithm, and the method specifically includes:
determining a target feature subset, and acquiring a feature value corresponding to each sample in a training sample set and the target feature subset;
determining the weight corresponding to each feature in the target feature subset through a Relieff algorithm based on the feature value corresponding to each sample in the training sample set and the target feature subset;
screening the features in the target feature subset based on the weight threshold corresponding to the target feature subset, and determining a first feature combination corresponding to the target feature subset;
and determining first average mutual information between each feature and the class C in the first feature combination and second average mutual information between each feature based on an mRMR algorithm, and screening the features in the first feature combination based on the first average mutual information and the second average mutual information to obtain a second feature combination corresponding to the target feature subset.
According to the method for determining karyotypic abnormal cells provided by the application, the features in the second feature combination are the features in the first feature combination, which maximize the difference or quotient between the first average mutual information and the second average mutual information.
According to the method for determining the karyotype abnormal cells, provided by the application, the secondary screening is performed on the candidate karyotype association feature set based on the mRMR algorithm and the backward selection method, and specifically the method comprises the following steps:
re-screening the features in the candidate karyotype associated feature set based on an mRMR algorithm, and determining a potential karyotype associated feature set;
and selecting the features in the potential karyotype association feature set based on a backward selection method, and determining the target karyotype association features.
According to the method for determining karyotype abnormal cells provided by the application, determining the weight corresponding to each feature in the target feature subset through a ReliefF algorithm based on the feature value corresponding to each sample in the training sample set and the target feature subset includes:
determining the difference between the feature values corresponding to the target sample and the similar nearest neighbor samples with the preset number and the difference between the feature values corresponding to the target sample and the different nearest neighbor samples with the preset number based on the feature values corresponding to the samples in the training sample set and the target feature subset;
and updating the weight corresponding to each feature in the target feature subset based on the difference between the feature values corresponding to the target sample and the similar nearest neighbor samples with the preset number and the difference between the feature values corresponding to the target sample and the different nearest neighbor samples with the preset number until the updating times reach a preset threshold value.
According to the determination method of the karyotype abnormal cell provided by the application, the target karyotype-related characteristic comprises: the cell nucleus perimeter, the cell nucleus area, the cell nucleus circularity, the cell nucleus elongation, the cell nucleus aspect ratio, the cell nucleus rectangularity, the cell nucleus firmness, the cell nucleus orientation degree, the cell nucleus long axis, the cell nucleus short axis, the cell nucleus microscopic image sharpness, the cell nucleus microscopic image total signal intensity, the cell nucleus microscopic image average signal intensity, the cell nucleus microscopic image signal intensity variance and the cell nucleus microscopic image corresponding Zernike moment.
The present application also provides an apparatus for determining a karyotype abnormal cell, including:
the target karyotype association characteristic value determining module is used for determining a target karyotype association characteristic value corresponding to the cell to be identified based on the cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype correlation characteristics are determined after screening a candidate karyotype correlation characteristic set from an initial karyotype correlation characteristic set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and performing secondary screening on the candidate karyotype correlation characteristic set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
the karyotype abnormal cell determination module is used for inputting the target karyotype associated characteristic value corresponding to the cell to be identified into the trained karyotype abnormal cell determination model and outputting a karyotype abnormal cell identification result corresponding to the target karyotype associated characteristic value;
the karyotype abnormal cell determination model is obtained by training a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label.
The present application further provides an electronic device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor implements the steps of the method for acquiring the karyotype abnormal cell determination described in any one of the above methods when executing the program.
The present application also provides a non-transitory computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the method for determining a karyotypically abnormal cell as described in any one of the above.
The present application also provides a computer program product comprising a computer program which, when executed by a processor, implements the steps of the method for determining a karyotypically abnormal cell as described in any one of the above.
The method and the device for determining the abnormal karyotype cells provided by the application determine target karyotype associated feature values corresponding to the cells to be identified based on a nuclear microscopic image corresponding to the cells to be identified, wherein the target karyotype associated features are determined by screening a candidate karyotype associated feature set from an initial karyotype associated feature set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and then determining the candidate karyotype associated feature set after secondary screening based on an mRMR algorithm and a backward selection method, wherein the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature, and the target karyotype associated feature values corresponding to the cells to be identified are input into a trained karyotype abnormal cell determination model And outputting a karyotype abnormal cell identification result corresponding to the target karyotype correlation characteristic value, wherein the karyotype abnormal cell determination model is obtained after training based on a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label, high-precision identification of the karyotype abnormal cell can be realized by using the minimum target karyotype correlation characteristic, meanwhile, the determination process of the karyotype abnormal cell is automatically completed through the model, the efficiency and accuracy of the karyotype abnormal cell identification are ensured, and further the efficiency and accuracy of tumor-related cell interpretation are ensured.
Drawings
In order to more clearly illustrate the technical solutions in the present application or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
FIG. 1 is a schematic flow chart of a method for determining karyotypic abnormal cells as provided herein;
FIG. 2 is a schematic illustration of the nuclear morphology of a karyotype normal cell provided herein;
FIG. 3 is a schematic representation of the nuclear morphology of the karyotypically abnormal cells provided herein;
FIG. 4 is a schematic structural diagram of an apparatus for determining karyotype abnormal cells provided in the present application;
fig. 5 is a schematic structural diagram of an electronic device provided in the present application.
Detailed Description
To make the purpose, technical solutions and advantages of the present application clearer, the technical solutions in the present application will be clearly and completely described below with reference to the drawings in the present application, and it is obvious that the described embodiments are some, but not all embodiments of the present application. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
In the prior art, when the interpretation of tumor-related cells in blood is performed, in order to ensure the interpretation accuracy, a preset number of images of the interpretation target cells (i.e., monocytes and granulocytes) need to be obtained through a microscope, and then a corresponding interpretation process is performed. However, due to factors such as environment, some of the non-interpretative target cells in the blood sample, including eosinophils, basophils, and neutrophils, cannot be completely filtered out. However, when the microscope collects the cell image, it is not possible to automatically identify the non-interpretation target cell, and it is necessary to judge whether the cell karyotype in the cell image is abnormal by manual observation, and further determine whether the cell karyotype is a non-interpretation target cell, and exclude the cell karyotype when the tumor-related cell interpretation is performed. The method is time-consuming and labor-consuming, has low accuracy, and cannot ensure the efficiency and accuracy of subsequent tumor-related cell interpretation. Based on this, the embodiment of the application provides a method and a device for determining a karyotype abnormal cell, so as to accurately and efficiently determine the karyotype abnormal cell, and further ensure efficiency and accuracy of tumor-related cell interpretation.
Fig. 1 is a schematic flow chart of a method for determining a karyotypic abnormal cell provided in the present application, as shown in fig. 1, the method includes:
step 110, determining a target karyotype correlation characteristic value corresponding to a cell to be identified based on a cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype correlation characteristics are determined after screening a candidate karyotype correlation characteristic set from an initial karyotype correlation characteristic set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and performing secondary screening on the candidate karyotype correlation characteristic set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature.
Specifically, the cell to be identified is a cell corresponding to a blood sample, in the embodiment of the present application, a cell microscopic image corresponding to the blood sample is firstly acquired through a microscope, and the number of the cell to be identified in the cell microscopic image is determined by the resolution of a microscope acquisition camera. The blood sample is stained in advance by a fluorescence in situ hybridization technique, and correspondingly, the cell microscopic image corresponds to four fluorescence channels of red, green, blue and gold. After the cell microscopic image is obtained, the cell nucleus in the cell microscopic image needs to be segmented to obtain a cell nucleus microscopic image corresponding to the cell to be identified. After obtaining the nuclear microscopic image corresponding to the cell to be identified, the determining device for the abnormal karyotype cell may determine the target karyotype associated characteristic value corresponding to the cell to be identified based on a predetermined characteristic extraction algorithm.
The prior art manually determines the karyotype abnormal cells by the following method: the shape of the cell nucleus in the cell nucleus microscopic image is visually judged by visual observation, and the method can roughly distinguish granulocytes from non-granulocytes so as to determine the abnormal karyotype cells. Fig. 2 is a schematic diagram of the nuclear morphology of the karyotype normal cells provided by the present application, and fig. 3 is a schematic diagram of the nuclear morphology of the karyotype abnormal cells provided by the present application, as shown in fig. 2-3, the nuclear shape of the karyotype normal cells (i.e., the anucleate cells) is approximately circular, and the nuclear shape of the karyotype abnormal cells (i.e., the granulocytes) is petal-shaped. However, due to the influence of the environment, the resolution of a microscope acquisition camera and other factors, the morphological difference between the nuclei of the normal karyotype cells and the nuclei of the abnormal karyotype cells is often difficult to be directly distinguished by naked eyes, so that the efficiency and the precision of identifying the abnormal karyotype cells in the prior art are extremely low, and the efficiency and the precision of subsequently judging the tumor-related cells are further reduced. Based on this, the embodiment of the application adopts a predetermined feature extraction algorithm to determine the target karyotype associated feature value corresponding to the cell to be identified, and performs automatic identification of the karyotype abnormal cell based on the target karyotype associated feature value corresponding to the cell to be identified.
Specifically, the inventors of the present application found, through research, that features (i.e., karyotype-associated features) in a nuclear microscopic image, which are closely related to nuclear karyotype determination, include four types, respectively: nuclear morphological characteristics, nuclear texture characteristics, color characteristics and edge characteristics; the morphological characteristics of the cell nucleus further comprise: the contour perimeter of the nucleus, the contour area of the nucleus, the contour circularity of the nucleus, the contour elongation of the nucleus, the aspect ratio (i.e. the ratio of the long axis to the short axis of the minimum circumscribed rectangle of the contour of the nucleus), the squareness (i.e. the ratio of the minimum circumscribed rectangle of the contour of the nucleus to the connected domain area), the firmness (i.e. the ratio of the connected domain area of the contour of the nucleus to the convex hull area), the direction degree (i.e. the angle of the ellipse fitted long axis of the contour of the nucleus), the short axis (i.e. the long axis of the ellipse fitted of the contour of the nucleus), and the long axis (i.e. the short axis of the ellipse fitted of the contour of the nucleus); the nuclear texture features further include: total signal intensity (i.e. sum of pixel values in the cell nucleus contour), average signal intensity (i.e. ratio of sum of pixel values in the cell nucleus contour to number of pixel points), signal intensity variance (i.e. variance of pixel values in the cell nucleus contour), inverse difference moment, sharpness, contrast; the color features further include: the background average signal intensity of each fluorescence channel and the background signal intensity variance of each fluorescence channel; the edge feature further comprises: zernike moments of orders 2-9.
For the above four types of features, if a full-scale analysis is performed to identify the abnormal karyotype cells, a huge workload is brought to feature extraction, so that the efficiency of identifying the abnormal karyotype cells is greatly reduced. Specifically, in the embodiment of the application, a candidate karyotype-associated feature set is obtained by screening an initial karyotype-associated feature set (i.e., a karyotype-associated feature set jointly composed of the morphological feature subset, the texture feature subset, the color feature subset and the edge feature subset) based on a ReliefF algorithm and a maximum correlation minimum redundancy mRMR algorithm, a preset number of features with the highest correlation with karyotype abnormal cells can be respectively obtained from each feature subset through the ReliefF algorithm, and then redundant features are removed based on the mRMR algorithm, so that the candidate karyotype-associated feature set can be obtained. It should be noted that, in the actual operation process, the features in a certain feature subset may be completely excluded due to the error execution of the ReliefF algorithm and/or the mRMR algorithm, thereby affecting the accuracy of the identification of the karyotype abnormal cells. Based on this, the candidate karyotype associated feature set obtained by screening from the initial karyotype associated feature set based on the ReliefF algorithm and the mRMR algorithm in the embodiments of the present application must be guaranteed to include at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature, and at least one candidate edge feature.
After the candidate karyotype associated feature set is determined, secondary screening is carried out on the candidate karyotype associated feature set based on an mRMR algorithm and a backward selection method so as to further remove redundant features among feature subsets, and further screening is carried out to obtain a preset number of target karyotype associated features with high correlation with karyotype abnormal cells.
Step 120, inputting the target karyotype association characteristic value corresponding to the cell to be identified into a trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result corresponding to the target karyotype association characteristic value;
the karyotype abnormal cell determination model is obtained by training a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label.
Specifically, after the target karyotype-associated feature value corresponding to the cell to be recognized is determined, the target karyotype-associated feature value corresponding to the cell to be recognized is input into a trained karyotype abnormal cell determination model, and a karyotype abnormal cell recognition result corresponding to the target karyotype-associated feature value is predicted and output through the karyotype abnormal cell determination model. The karyotype abnormal cell determination model is obtained by training based on a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label. It is understood that the karyotypically abnormal cells are recognized as karyotypically abnormal or karyotypically normal.
Furthermore, after the karyotype abnormal cell identification result is obtained through the karyotype abnormal cell determination model, the embodiment of the application may further determine the proportion of the cells to be identified in the cell microscopic image, which are considered as karyotype abnormal cells, determine the abnormal grade of the cells to be identified based on the proportion of the karyotype abnormal cells in the cell microscopic image, and further determine whether to reacquire the cell microscopic image, that is, determine whether to give up subsequent interpretation operation on the cells to be identified in the current cell microscopic image, reacquire the cell microscopic images of other cells to be identified for subsequent interpretation, and avoid that the efficiency of subsequent tumor-related cell interpretation is affected due to excessive karyotype abnormal cells in the current cell microscopic image.
The method provided by the embodiment of the application determines a target karyotype association feature value corresponding to a cell to be identified based on a cell nucleus microscopic image corresponding to the cell to be identified, wherein the target karyotype association feature is determined by screening a candidate karyotype association feature set from an initial karyotype association feature set based on a Relieff algorithm and a maximum correlation minimum redundancy mRMR algorithm, and performing secondary screening on the candidate karyotype association feature set based on an mRMR algorithm and a backward selection method, the initial karyotype association feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, the candidate karyotype association feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature, and the target karyotype association feature value corresponding to the cell to be identified is input into a trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result corresponding to the target karyotype correlation characteristic value, wherein the karyotype abnormal cell determination model is obtained by training based on a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label, high-precision identification of the karyotype abnormal cell can be realized by using the minimum target karyotype correlation characteristic, and meanwhile, the determination process of the karyotype abnormal cell is automatically completed through the model, so that the efficiency and accuracy of the karyotype abnormal cell identification are ensured, and further the efficiency and accuracy of tumor-related cell interpretation are ensured.
Based on the above embodiment, the step of acquiring the cell nucleus microscopic image corresponding to the cell to be identified includes:
segmenting the cell microscopic image corresponding to the blood sample based on a preset segmentation network to obtain an initial cell nucleus microscopic image corresponding to the blood sample;
determining candidate cell nucleuses with over-segmentation based on the position relation of each cell nucleus and the number of fluorescent signal points in the cell nucleus in initial cell nucleus microscopic images corresponding to different fluorescent channels of the blood sample, and correcting the candidate cell nuclei based on a convex hull detection algorithm and a convex defect algorithm to obtain cell nucleus corrected microscopic images corresponding to the blood sample;
and taking the cell nucleus correction microscopic image corresponding to the blood sample as the cell nucleus microscopic image corresponding to the cell to be identified.
Specifically, the conventional cell nucleus segmentation method is to segment the cell nucleus in the cell microscopic image based on a deep neural network (e.g., Mask-RCNN, etc.), and this method can accurately segment the cell nucleus of the granulocytic cell. However, the inventor of the present application finds, through research, that since the nucleus of the granulocyte is usually petaloid, the nucleus of the granulocyte is easily segmented into a plurality of different nuclei (i.e., over-segmentation) by using the conventional manner of performing the nucleus segmentation based on the deep neural network, and a nucleus segmentation error will seriously affect the identification of the subsequent abnormal karyotype cells, which results in a reduction in the identification accuracy of the abnormal karyotype cells and a statistical error in the number of cells, and further results in a decision error as to whether to acquire a cell microscopic image again, which affects the efficiency and accuracy of the subsequent tumor-related cell interpretation. Based on this, after segmenting the cell microscopic image corresponding to the blood sample, the embodiment of the present application further determines and corrects the over-segmented cell nucleus, which includes the following specific steps:
firstly, a cell microscopic image corresponding to the blood sample is segmented based on a predetermined segmentation network (i.e. the aforementioned deep neural network) to obtain an initial nucleus microscopic image corresponding to the blood sample. It is understood that the initial nuclear microscopy images correspond to the four different fluorescence channels described above.
After the initial cell nucleus microscopic image corresponding to the blood sample is obtained, determining candidate cell nuclei with over-segmentation based on the position relation of each cell nucleus and the number of fluorescence signal points in the cell nuclei in the initial cell nucleus microscopic image corresponding to different fluorescence channels of the blood sample. It should be noted that if there is an over-segmentation situation in the nuclei of a certain target granulocyte, the nuclei of the corresponding target granulocyte in the initial nucleus microscopic images corresponding to different fluorescence channels of the blood sample will be over-segmented (for example, the nuclei of the target granulocyte are segmented into two sub-nuclei, and the nuclei of the target granulocyte in the initial nucleus microscopic images corresponding to four fluorescence channels will be segmented into two sub-nuclei). In addition, since the over-segmentation is to segment one target cell nucleus into a plurality of cell nuclei, the segmented cell nuclei are adjacent to each other. It is understood that the positional adjacency may be either a direct adjacency (i.e., the two cell nucleus outlines are in direct contact) or an indirect adjacency (i.e., the multiple cell nuclei are in turn in direct adjacency, wherein the cell nuclei that are not in direct contact with the two outlines are referred to as indirect adjacency). Meanwhile, based on the principle of fluorescence in situ hybridization technology, it is known that for each fluorescence channel, two fluorescence signal points should be included in the same cell nucleus. Based on the principle, if multiple adjacent cell nuclei exist in the same area in the initial cell nucleus microscopic images corresponding to different fluorescence channels of the blood sample, and the sum of the number of fluorescence signal points in the multiple adjacent cell nuclei is 2, the multiple adjacent cell nuclei are over-segmented cell nuclei, and at the moment, the correction can be completed only by merging the multiple adjacent cell nuclei. This judgment method is applicable to the case of over-segmentation of single granulocytes.
However, since there may be cell aggregation during the preparation of the blood sample, there may be a case where the anucleated cells and the nuclei of granulocytes are adjacent in the initial cell nucleus microscopic image corresponding to the blood sample, and at this time, the above-mentioned method for determining the nucleus that is over-segmented cannot accurately determine the target nucleus that is over-segmented, such as the adjacent nucleus A, B, C, where the nucleus a is the nucleus of the anucleated cells, and the nuclei B and C are two nuclei obtained by over-segmentation of granulocytes, it can be understood that there are two fluorescence signal points in the nucleus a, and it is assumed that there are two fluorescence signal points in the nucleus B and no fluorescence signal point in the nucleus C. At this time, it is not possible to determine whether the over-divided cell nuclei are a and C or B and C based on the above determination manner. Meanwhile, in the initial nucleus microscopic image corresponding to the blood sample, there may be a case where nuclei of two granulocytes are adjacent, and at this time, the above-mentioned method for determining the nucleus over-segmented cannot accurately determine the nucleus over-segmented, such as adjacent nucleus D, E, F, G, where nuclei D and E are two nuclei over-segmented by granulocyte M, nuclei F and G are two nuclei over-segmented by granulocyte N, and it is assumed that there are two fluorescence signal points in both nuclei D and F and there is no fluorescence signal point in nuclei E and G. At this time, it is not possible to determine whether the over-divided cell nuclei are combined into DE and FG or DG and FE based on the above-described determination method. If the cell judgment of over-segmentation is performed and the cell judgment is combined only based on the sum of the number of fluorescent signal points in the adjacent cell nuclei being 2 at this time, the cell nucleus segmentation error will be further caused. Based on this, the embodiment of the present application first determines that there is an over-segmented candidate nucleus, i.e., the nuclei a-C or D-G with adjacent positions in the foregoing example, based on the position relationship of each nucleus and the number of fluorescence signal points in the nucleus in the initial nucleus microscopic image corresponding to different fluorescence channels of the blood sample, and it can be understood from the foregoing example that if there are a plurality of adjacent nuclei in the same area in the initial nucleus microscopic image corresponding to different fluorescence channels of the blood sample, and the sum of the numbers of fluorescence signal points of at least two nuclei in the plurality of adjacent nuclei is 2, then the plurality of adjacent nuclei will be determined as there is an over-segmented candidate nucleus. After the candidate cell nucleus is determined, the candidate cell nucleus is corrected based on a convex hull detection algorithm and a convex defect algorithm, so that the accuracy of cell nucleus correction can be ensured.
And after correcting all the candidate cell nucleuses with excessive segmentation, obtaining a cell nucleus correction microscopic image corresponding to the blood sample, and taking the cell nucleus correction microscopic image as a cell nucleus microscopic image corresponding to the cell to be identified. It is understood that in the cell nucleus corrected microscopic image, the over-segmented cell nuclei have been corrected, i.e. the cell nuclei in the cell nucleus corrected microscopic image can be identified as accurately segmented cell nuclei. Based on the method, the accuracy of the identification result of the subsequent karyotype abnormal cells can be ensured to the maximum extent.
According to the method provided by the embodiment of the application, the cell microscopic image corresponding to the blood sample is segmented based on the preset segmentation network so as to obtain the initial cell nucleus microscopic image corresponding to the blood sample; determining over-segmented cell nuclei and correcting the over-segmented cell nuclei to obtain cell nucleus corrected microscopic images corresponding to the blood sample based on the position relation of each cell nucleus and the number of fluorescent signal points in the cell nuclei in initial cell nucleus microscopic images corresponding to different fluorescent channels of the blood sample; and taking the cell nucleus corrected microscopic image corresponding to the blood sample as the cell nucleus microscopic image corresponding to the cell to be identified, so that the accuracy of cell nucleus segmentation can be ensured, and the accuracy of a subsequent karyotype abnormal cell identification result can be further ensured.
Based on the above embodiment, the step of correcting the candidate cell nucleus based on the convex hull detection algorithm and the convex defect algorithm specifically includes:
determining a cell nucleus set formed by target cell nuclei of which the sum of the number of fluorescence signal points is a preset value in the candidate cell nuclei;
merging the target cell nucleuses in the cell nucleus set to obtain a corresponding corrected cell nucleus;
determining a contour point set which is farthest from a convex hull on the contour of the modified cell nucleus based on a convex hull detection algorithm and a convex defect algorithm, and determining a target contour point in the contour point set based on a preset distance threshold, wherein the target contour point is a contour point in the contour point set, the distance between the contour point set and the convex hull of which exceeds the preset distance threshold;
determining whether the modified cell nuclei are correct based on the number of the target contour points, and deleting incorrect modified cell nuclei.
Specifically, as can be seen from the foregoing embodiment, the preset value is 2, and it can be understood that, since the sum of the number of fluorescence signal points of a plurality of groups of target cell nuclei may be 2, the set of cell nuclei may be one or more. For one case (i.e., corresponding to the case of over-segmentation of single granulocytes), merging the target nuclei in the set of nuclei to obtain corresponding modified nuclei is sufficient. However, in the case of a plurality of cell nuclei, it can be known from the foregoing embodiment that it is not possible to accurately determine which cell nucleus set corresponds to the target cell nucleus that is an over-segmented cell nucleus, and therefore, it is necessary to further identify the merged modified cell nucleus (in this case, a plurality of modified cell nuclei are generated), so as to ensure the accuracy of the cell nucleus modification. However, the inventors of the present application found, through research, that due to the specificity of the shape of the nuclei of the granulocytes, there is a significant difference between the contours of the correctly merged corrected nuclei (corresponding to the case where a plurality of nuclei of the same granulocytes over-segmented merge) and the incorrectly merged corrected nuclei (corresponding to the case where the nuclei of the granulocytes over-segmented merge with the nuclei of the agranulocytes and the case where a plurality of nuclei of the granulocytes over-segmented merge across), and the difference can be determined by the set of contour points on the contour of the corrected nuclei that are farthest from the convex hull, that is, if the number of target contour points on the contour of the corrected nuclei that are farthest from the convex hull exceeds the preset distance threshold exceeds the preset number threshold, the corrected nuclei can be determined to be correct. The preset distance threshold is preferably 900 pixels, the preset number threshold is preferably 4, and the preset distance threshold and the preset number threshold may be reasonably adjusted according to an actual application scenario, which is not specifically limited in the embodiment of the present application.
Based on the above, in the embodiment of the application, a contour point set which is farthest from a convex hull on the contour of each corrected cell nucleus is determined based on a convex hull detection algorithm and a convex defect algorithm, target contour points in the contour point set are determined based on a preset distance threshold, whether the corrected cell nucleus is correct or not can be determined based on the number of the target contour points, and a cell nucleus corrected microscopic image corresponding to the blood sample can be obtained by deleting incorrect corrected cell nuclei. It is understood that the deleting of the incorrectly corrected cell nucleus refers to deleting a corresponding cell nucleus contour in the cell nucleus microscopic image, which may be implemented by a corresponding algorithm, and the implementation manner of the algorithm is not specifically limited in the embodiments of the present application. As for the convex hull detection algorithm and the convex defect algorithm, which are conventional algorithms in the art, the detailed operation steps are not described herein.
It should be further understood that any existing contour merging algorithm may be used to implement the merging and correcting process, and this is not limited in this embodiment of the present application.
According to the method provided by the embodiment of the application, a cell nucleus set formed by target cell nuclei is determined, wherein the sum of the number of fluorescence signal points in the candidate cell nuclei is a preset value; merging the target cell nucleuses in the cell nucleus set to obtain a corresponding corrected cell nucleus; determining a contour point set which is farthest from a convex hull on the contour of the modified cell nucleus based on a convex hull detection algorithm and a convex defect algorithm, and determining a target contour point in the contour point set based on a preset distance threshold, wherein the target contour point is a contour point in the contour point set, the distance between the contour point set and the convex hull of which exceeds the preset distance threshold; determining whether the modified cell nuclei are correct based on the number of the target contour points, and deleting incorrect modified cell nuclei. The method can accurately correct the excessively segmented cell nucleus under the condition of cell aggregation, further ensure the accuracy of cell nucleus segmentation, and further ensure the accuracy of a subsequent karyotype abnormal cell identification result.
Based on the above embodiment, the screening of the initial karyotype correlation feature set based on the ReliefF algorithm and the maximum correlation minimum redundancy mRMR algorithm to obtain the candidate karyotype correlation feature set specifically includes:
respectively performing feature screening on each feature subset in the initial karyotype associated feature set based on a Relieff algorithm and an mrMR algorithm to obtain at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
and combining the candidate morphological characteristics, the candidate texture characteristics, the candidate color characteristics and the candidate edge characteristics to obtain the candidate karyotype association characteristic set.
Specifically, as can be seen from the foregoing embodiments, in order to avoid that all features in a certain feature subset are excluded due to errors in the implementation of the ReliefF algorithm and/or the mRMR algorithm, thereby affecting the accuracy of karyotypic abnormal cell identification, it is required to ensure that at least one candidate karyotype-associated feature is retained in each feature subset after screening based on the ReliefF algorithm and the mRMR algorithm. Based on this, in the embodiment of the application, feature screening is performed on each feature subset in the initial karyotype-associated feature set based on a ReliefF algorithm and an mRMR algorithm, so as to obtain at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature, and at least one candidate edge feature. Meanwhile, each feature subset is respectively screened, so that the candidate karyotype correlation features corresponding to each feature subset can be guaranteed to be the features with the largest correlation degree and the smallest redundancy with the karyotype abnormal cells, and the efficiency and the accuracy of subsequent karyotype abnormal cell identification are guaranteed.
The method provided by the embodiment of the application, the screening of the initial karyotype correlation feature set based on the ReliefF algorithm and the maximum correlation minimum redundancy mRMR algorithm to obtain the candidate karyotype correlation feature set, specifically includes: respectively performing feature screening on each feature subset in the initial karyotype associated feature set based on a Relieff algorithm and an mRMR algorithm to obtain at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature, and combining the candidate morphological feature, the candidate texture feature, the candidate color feature and the candidate edge feature to obtain the candidate karyotype associated feature set, so that the efficiency of feature screening is ensured, and the efficiency and the accuracy of subsequent karyotype abnormal cell identification are ensured.
Based on any of the above embodiments, the performing feature screening on each feature subset in the initial karyotype-associated feature set based on the ReliefF algorithm and the mRMR algorithm specifically includes:
determining a target feature subset, and acquiring a feature value corresponding to each sample in a training sample set and the target feature subset;
determining the weight corresponding to each feature in the target feature subset through a Relieff algorithm based on the feature value corresponding to each sample in the training sample set and the target feature subset;
screening the features in the target feature subset based on the weight threshold corresponding to the target feature subset, and determining a first feature combination corresponding to the target feature subset;
and determining first average mutual information between each feature and the class C in the first feature combination and second average mutual information between each feature based on an mRMR algorithm, and screening the features in the first feature combination based on the first average mutual information and the second average mutual information to obtain a second feature combination corresponding to the target feature subset.
Specifically, a training sample set is preset in the embodiment of the present application, where the training sample set includes a first preset number of karyotype normal cell samples and a second preset number of karyotype abnormal cell samples. After a target feature subset is determined, feature values corresponding to samples in the training sample set and the target feature subset are obtained, weights corresponding to features in the target feature subset are determined through a Relieff algorithm based on the feature values corresponding to the samples in the training sample set and the target feature subset, specifically, the sampling frequency m is determined in advance, and the following steps are executed in a circulating mode until the sampling frequency m is reached: randomly taking a sample R from a training sample set, finding k adjacent samples of the R from a sample set which is the same as the sample R (assuming that the R is a karyotype abnormal cell sample, the sample set which is the same is the karyotype abnormal cell sample), finding k adjacent samples from a sample set which is different from the sample R (assuming that the R is the karyotype abnormal cell sample, the sample set which is different from the karyotype normal cell sample), and updating the weight of each feature through a feature weight calculation formula of a Relieff algorithm based on a feature value corresponding to the same feature of the sample R and the adjacent samples. As for the characteristic weight calculation formula of the ReliefF algorithm, it is well known in the art and will not be described herein.
After determining the weight corresponding to each feature in the target feature subset, screening the features in the target feature subset based on the weight threshold corresponding to the target feature subset, and determining a first feature combination corresponding to the target feature subset. It is to be understood that the features in the first combination of features are features having a weight greater than the weight threshold. It should be noted that, normally, in consideration of ensuring that the screening criteria are unified, the weight threshold corresponding to each feature subset should be the same, but based on the foregoing embodiments, since the ReliefF algorithm may be executed with an error, so that the weight of each feature in the target feature subset is smaller than the weight threshold, in view of ensuring the accuracy of identifying abnormal karyotype cells, in the embodiment of the present application, when the weight of each feature in the target feature subset is smaller than the weight threshold, the weight threshold corresponding to the target feature subset is appropriately reduced, so as to ensure that at least one feature is included in the first feature combination. It can be understood that, since the weights of all the features in the target feature subset are updated every time sampling is performed, even if the ReliefF algorithm is performed with errors, the weights of the features are biased along the same trend (i.e., biased to be larger or smaller), and based on this, even if the weight threshold is adjusted, the features with low correlation are not introduced, so that the accuracy of feature screening is ensured.
After the first feature combination corresponding to the target feature subset is determined, in the embodiment of the present application, first average mutual information between each feature in the first feature combination and the class C (i.e., karyotype abnormal cells) and second average mutual information between each feature are determined based on an mRMR algorithm, and features in the first feature combination are screened based on the first average mutual information and the second average mutual information to obtain a second feature combination corresponding to the target feature subset, so that redundant features in the target feature subset can be deleted based on the second feature combination. As for the calculation formulas of the first average mutual information and the second average mutual information, they are well known in the art and will not be described herein again. It is understood that when there is only one feature in the first feature combination, it will be retained as a second feature combination corresponding to the target feature subset, based on which, it can be ensured that at least one feature is included in the second feature combination, thereby ensuring the accuracy of subsequent karyotype abnormal cell identification.
In the method provided by the embodiment of the present application, the performing feature screening on each feature subset in the initial karyotype-associated feature set based on the ReliefF algorithm and the mRMR algorithm specifically includes: determining a target feature subset, acquiring a feature value corresponding to each sample in a training sample set and the target feature subset, and based on the feature value corresponding to each sample in the training sample set and the target feature subset, determining the weight corresponding to each feature in the target feature subset through a Relieff algorithm, screening the features in the target feature subset based on a weight threshold corresponding to the target feature subset, determining a first feature combination corresponding to the target feature subset, determining first average mutual information between each feature and the class C in the first feature combination and second average mutual information between each feature based on an mRMR algorithm, and screening the features in the first feature combination based on the first average mutual information and the second average mutual information to obtain a second feature combination corresponding to the target feature subset, so that the efficiency and the accuracy of subsequent karyotype abnormal cell identification can be ensured.
Based on any of the above embodiments, the feature in the second feature combination is a feature in the first feature combination that maximizes a difference or quotient between the first average mutual information and the second average mutual information.
Specifically, based on the principle of the mRMR algorithm, the feature with the maximum difference or quotient between the first average mutual information and the second average mutual information is the feature with the maximum correlation with the category C and the minimum redundancy, so that the efficiency and accuracy of subsequent karyotype abnormal cell identification can be ensured.
In the method provided by the embodiment of the present application, the feature in the second feature combination is a feature in the first feature combination that maximizes the difference or quotient between the first average mutual information and the second average mutual information, which can ensure that the correlation between the feature in the second feature combination and the abnormal karyotype cell is maximized and the redundancy is minimized, thereby ensuring the efficiency and accuracy of subsequent karyotype abnormal cell identification.
Based on any of the above embodiments, the secondary screening of the candidate karyotype-associated feature set based on the mRMR algorithm and the backward selection method specifically includes:
re-screening the features in the candidate karyotype associated feature set based on an mRMR algorithm to determine a potential karyotype associated feature set;
and selecting the features in the potential karyotype association feature set based on a backward selection method, and determining the target karyotype association feature.
Specifically, based on the foregoing embodiments, in the candidate karyotype-associated feature set obtained by screening from the initial karyotype-associated feature set based on the ReliefF algorithm and the mRMR algorithm, the candidate feature (i.e., the second feature combination) corresponding to each feature subset is the feature with the largest correlation with the karyotype abnormal cells and the smallest redundancy in the corresponding feature subset. However, in practical applications, the inventors of the present application find that feature redundancy may also exist among feature subsets, and based on this, in the embodiments of the present application, after a candidate karyotype-associated feature set (i.e., a set formed by second feature combinations corresponding to the feature subsets) is determined, features in the candidate karyotype-associated feature set may be re-screened based on an mRMR algorithm to eliminate redundant features among different second feature combinations, so as to obtain a potential karyotype-associated feature set.
After the potential karyotype association feature set is obtained, features in the potential karyotype association feature set are selected based on a backward selection method, and the target karyotype association feature is determined, so that the efficiency and accuracy of identification of the karyotype abnormal cells can be guaranteed to the maximum extent through the target karyotype association feature. It is understood that the forward selection method may also be used to screen the set of potential karyotype association features, which is the same as the backward selection method.
In the method provided by the embodiment of the present application, the secondary screening of the candidate karyotype correlation feature set based on the mRMR algorithm and the backward selection method specifically includes: and re-screening the features in the candidate karyotype associated feature set based on an mRMR algorithm to determine a potential karyotype associated feature set, selecting the features in the potential karyotype associated feature set based on a backward selection method to determine the target karyotype associated feature, and further screening the karyotype associated feature to ensure the identification efficiency and accuracy of the karyotype abnormal cells to the maximum extent.
Based on any one of the embodiments above, the determining, by a ReliefF algorithm, a weight corresponding to each feature in the target feature subset based on the feature value corresponding to each sample in the training sample set and the target feature subset includes:
determining the difference between the feature values corresponding to the target sample and the similar nearest neighbor samples with the preset number and the difference between the feature values corresponding to the target sample and the different nearest neighbor samples with the preset number based on the feature values corresponding to the samples in the training sample set and the target feature subset;
and updating the weight corresponding to each feature in the target feature subset based on the difference between the feature values corresponding to the target sample and the similar nearest neighbor samples in the preset number and the difference between the feature values corresponding to the target sample and the different nearest neighbor samples in the preset number until the updating frequency reaches a preset threshold value.
Specifically, based on the foregoing embodiment, after determining the feature value corresponding to each sample in the training sample set and the target feature subset, the weight corresponding to each feature in the target feature subset may be calculated, specifically, based on the feature value corresponding to each sample in the training sample set and the target feature subset, the difference between the feature value corresponding to the target sample and the predetermined number (i.e., k in the foregoing embodiment) of the same-type nearest neighbor samples and the difference between the feature value corresponding to the target sample and the predetermined number of different-type nearest neighbor samples are determined, and based on the difference between the feature value corresponding to the target sample and the predetermined number of same-type nearest neighbor samples and the difference between the feature value corresponding to the target sample and the predetermined number of different-type nearest neighbor samples, the weight corresponding to each feature in the target feature subset is updated until the update number reaches the predetermined threshold (i.e., the foregoing number m), the weights of the features in the target feature subset can be determined quickly. As for the specific updating process of the weights, the foregoing embodiments have been described, and are not described herein again. It can be understood that values of k and m may be set according to actual needs, and this is not specifically limited in the embodiment of the present application.
The method provided by the embodiment of the present application, determining, by a ReliefF algorithm, weights corresponding to features in the target feature subset based on feature values corresponding to the samples in the training sample set and the target feature subset, includes: based on the characteristic values corresponding to the samples in the training sample set and the target characteristic subset, determining the difference between the characteristic values corresponding to the target sample and the similar nearest neighbor samples with the preset number, and the difference between the characteristic values corresponding to the target sample and the different nearest neighbor samples with the preset number, and updating the weight corresponding to each feature in the target characteristic subset until the updating frequency reaches a preset threshold value, so that the weight of each feature in each target characteristic subset can be rapidly determined, and the efficiency and the accuracy of feature screening are ensured.
Based on any of the above embodiments, the target karyotype association characteristics include: the cell nucleus perimeter, the cell nucleus area, the cell nucleus circularity, the cell nucleus elongation, the cell nucleus aspect ratio, the cell nucleus rectangularity, the cell nucleus firmness, the cell nucleus direction degree, the cell nucleus long axis, the cell nucleus short axis, the cell nucleus microscopic image sharpness, the cell nucleus microscopic image total signal intensity, the cell nucleus microscopic image average signal intensity, the cell nucleus microscopic image signal intensity variance and the cell nucleus microscopic image corresponding Zernike moment.
Specifically, in the embodiment of the present application, a candidate karyotype associated feature set is obtained by screening from an initial karyotype associated feature set based on a ReliefF algorithm and an mRMR algorithm, and a target karyotype associated feature determined after performing secondary screening on the candidate karyotype associated feature set based on the mRMR algorithm and a backward selection method includes: the cell nucleus perimeter, the cell nucleus area, the cell nucleus circularity, the cell nucleus elongation, the cell nucleus aspect ratio, the cell nucleus rectangularity, the cell nucleus firmness, the cell nucleus orientation degree, the cell nucleus long axis, the cell nucleus short axis, the cell nucleus microscopic image sharpness, the cell nucleus microscopic image total signal intensity, the cell nucleus microscopic image average signal intensity, the cell nucleus microscopic image signal intensity variance and the cell nucleus microscopic image corresponding Zernike moment. It can be understood that the Zernike moments corresponding to the nuclear microscopic images, namely the aforementioned 2-9 order Zernike moments, can be considered based on the efficiency and accuracy of abnormal karyotype cell identification. Meanwhile, it can be seen that the finally obtained target karyotype associated features do not include color features, because the features in the color feature subset are redundant with the other three feature subsets, and therefore, the features in the color feature subset are removed. When the candidate karyotype associated feature set is obtained by screening from the initial karyotype associated feature set based on the Relieff algorithm and the mRMR algorithm, as the feature subsets are respectively screened, the comprehensiveness of the features in the candidate karyotype associated feature set is ensured, the error deletion of useful features is avoided, the candidate karyotype associated feature set is secondarily screened by the mRMR algorithm and a backward selection method, redundant features among the feature subsets are eliminated, the identification accuracy of karyotype abnormal cells is ensured, and the identification efficiency is also considered.
In the method provided by the embodiment of the present application, the target karyotype correlation characteristic includes: the cell nucleus perimeter, the cell nucleus area, the cell nucleus circularity, the cell nucleus elongation, the cell nucleus width-to-height ratio, the cell nucleus rectangularity, the cell nucleus firmness, the cell nucleus direction degree, the cell nucleus long axis, the cell nucleus short axis, the cell nucleus microscopic image sharpness, the cell nucleus microscopic image total signal intensity, the cell nucleus microscopic image average signal intensity, the cell nucleus microscopic image signal intensity variance and the cell nucleus microscopic image corresponding Zernike moment, so that the identification accuracy of the nuclear abnormal cells is guaranteed, and the identification efficiency is also considered.
The device for determining abnormal karyotype cells provided by the present application is described below, and the device for determining abnormal karyotype cells described below and the method for determining abnormal karyotype cells described above may be referred to in correspondence with each other.
Based on any of the above embodiments, fig. 4 is a schematic structural diagram of the device for determining a karyotypic abnormal cell provided by the present application, as shown in fig. 4, the device includes:
the target karyotype association characteristic value determining module 410 is configured to determine a target karyotype association characteristic value corresponding to a cell to be identified based on a cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype correlation characteristics are determined after screening a candidate karyotype correlation characteristic set from an initial karyotype correlation characteristic set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and performing secondary screening on the candidate karyotype correlation characteristic set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
a karyotype abnormal cell determining module 420, configured to input the target karyotype associated characteristic value corresponding to the cell to be identified into the trained karyotype abnormal cell determining model, and output a karyotype abnormal cell identification result corresponding to the target karyotype associated characteristic value;
the karyotype abnormal cell determination model is obtained by training a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label.
The device provided by the embodiment of the application determines a target karyotype associated feature value corresponding to a cell to be identified based on a cell nucleus microscopic image corresponding to the cell to be identified, wherein the target karyotype associated feature is determined by screening a candidate karyotype associated feature set from an initial karyotype associated feature set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and then secondarily screening the candidate karyotype associated feature set based on the mRMR algorithm and a backward selection method, the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature, and the target karyotype associated feature value corresponding to the cell to be identified is input into a trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result corresponding to the target karyotype correlation characteristic value, wherein the karyotype abnormal cell determination model is obtained by training based on a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label, high-precision identification of the karyotype abnormal cell can be realized by using the minimum target karyotype correlation characteristic, and meanwhile, the determination process of the karyotype abnormal cell is automatically completed through the model, so that the efficiency and accuracy of the karyotype abnormal cell identification are ensured, and further the efficiency and accuracy of tumor-related cell interpretation are ensured.
Based on the above embodiment, the apparatus further includes a candidate karyotype associated feature set determination module, where the candidate karyotype associated feature set determination module is configured to perform the following steps:
respectively performing feature screening on each feature subset in the initial karyotype associated feature set based on a Relieff algorithm and an mrMR algorithm to obtain at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
and combining the candidate morphological characteristics, the candidate texture characteristics, the candidate color characteristics and the candidate edge characteristics to obtain the candidate karyotype association characteristic set.
Based on any of the above embodiments, the performing feature screening on each feature subset in the initial karyotype associated feature set based on the ReliefF algorithm and the mRMR algorithm specifically includes:
determining a target feature subset, and acquiring a feature value corresponding to each sample in a training sample set and the target feature subset;
determining the weight corresponding to each feature in the target feature subset through a Relieff algorithm based on the feature value corresponding to each sample in the training sample set and the target feature subset;
screening the features in the target feature subset based on the weight threshold corresponding to the target feature subset, and determining a first feature combination corresponding to the target feature subset;
and determining first average mutual information between each feature and the class C in the first feature combination and second average mutual information between each feature based on an mRMR algorithm, and screening the features in the first feature combination based on the first average mutual information and the second average mutual information to obtain a second feature combination corresponding to the target feature subset.
Based on any of the above embodiments, the feature in the second feature combination is a feature in the first feature combination that maximizes a difference or quotient between the first average mutual information and the second average mutual information.
Based on any of the above embodiments, the apparatus further comprises a secondary screening module, the secondary screening module is configured to perform the following steps:
re-screening the features in the candidate karyotype associated feature set based on an mRMR algorithm to determine a potential karyotype associated feature set;
and selecting the features in the potential karyotype association feature set based on a backward selection method, and determining the target karyotype association features.
Based on any one of the embodiments above, the determining, by a ReliefF algorithm, a weight corresponding to each feature in the target feature subset based on the feature value corresponding to each sample in the training sample set and the target feature subset includes:
determining the difference between the characteristic values corresponding to the target sample and the similar nearest neighbor samples with the preset number and the difference between the characteristic values corresponding to the target sample and the different nearest neighbor samples with the preset number based on the characteristic values corresponding to the samples in the training sample set and the target characteristic subset;
and updating the weight corresponding to each feature in the target feature subset based on the difference between the feature values corresponding to the target sample and the similar nearest neighbor samples in the preset number and the difference between the feature values corresponding to the target sample and the different nearest neighbor samples in the preset number until the updating frequency reaches a preset threshold value.
Based on any of the above embodiments, the target karyotype correlation features include: the cell nucleus perimeter, the cell nucleus area, the cell nucleus circularity, the cell nucleus elongation, the cell nucleus aspect ratio, the cell nucleus rectangularity, the cell nucleus firmness, the cell nucleus direction degree, the cell nucleus long axis, the cell nucleus short axis, the cell nucleus microscopic image sharpness, the cell nucleus microscopic image total signal intensity, the cell nucleus microscopic image average signal intensity, the cell nucleus microscopic image signal intensity variance and the cell nucleus microscopic image corresponding Zernike moment.
Fig. 5 illustrates a physical structure diagram of an electronic device, which may include, as shown in fig. 5: a processor (processor)510, a communication Interface (Communications Interface)520, a memory (memory)530 and a communication bus 540, wherein the processor 510, the communication Interface 520 and the memory 530 communicate with each other via the communication bus 540. Processor 510 may call logic instructions in memory 530 to perform the method for determining a karyotypically abnormal cell provided by the above methods, the method comprising: determining a target karyotype association characteristic value corresponding to the cell to be identified based on a cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype association feature is determined after a candidate karyotype association feature set is obtained by screening from an initial karyotype association feature set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and then secondary screening is carried out on the candidate karyotype association feature set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature; inputting the target karyotype association characteristic value corresponding to the cell to be identified into a trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result corresponding to the target karyotype association characteristic value; the karyotype abnormal cell determination model is obtained by training a target karyotype association characteristic value sample and a predetermined karyotype abnormal cell identification result label.
Furthermore, the logic instructions in the memory 530 may be implemented in the form of software functional units and stored in a computer readable storage medium when the software functional units are sold or used as independent products. Based on such understanding, the technical solution of the present application or portions thereof that substantially contribute to the prior art may be embodied in the form of a software product stored in a storage medium and including instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In another aspect, the present application further provides a computer program product, the computer program product includes a computer program, the computer program can be stored on a non-transitory computer readable storage medium, when the computer program is executed by a processor, a computer can execute the method for determining karyotype abnormal cells provided by the above methods, the method includes: determining a target karyotype association characteristic value corresponding to the cell to be identified based on the cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype correlation characteristics are determined after screening a candidate karyotype correlation characteristic set from an initial karyotype correlation characteristic set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and performing secondary screening on the candidate karyotype correlation characteristic set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature; inputting the target karyotype association characteristic value corresponding to the cell to be identified into a trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result corresponding to the target karyotype association characteristic value; the karyotype abnormal cell determination model is obtained by training a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label.
In yet another aspect, the present application also provides a non-transitory computer-readable storage medium having stored thereon a computer program, which when executed by a processor, implements a method for determining karyotype abnormal cells provided by the above methods, the method comprising: determining a target karyotype association characteristic value corresponding to the cell to be identified based on a cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype association feature is determined after a candidate karyotype association feature set is obtained by screening from an initial karyotype association feature set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and then secondary screening is carried out on the candidate karyotype association feature set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature; inputting the target karyotype association characteristic value corresponding to the cell to be identified into a trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result corresponding to the target karyotype association characteristic value; the karyotype abnormal cell determination model is obtained by training a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present application, and not to limit the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present application.

Claims (12)

1. A method for determining karyotypically abnormal cells, comprising:
determining a target karyotype association characteristic value corresponding to the cell to be identified based on a cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype association feature is determined after a candidate karyotype association feature set is obtained by screening from an initial karyotype association feature set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and then secondary screening is carried out on the candidate karyotype association feature set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
inputting the target karyotype association characteristic value corresponding to the cell to be identified into a trained karyotype abnormal cell determination model, and outputting a karyotype abnormal cell identification result corresponding to the target karyotype association characteristic value;
the karyotype abnormal cell determination model is obtained by training a target karyotype correlation characteristic value sample and a predetermined karyotype abnormal cell identification result label.
2. The method for determining karyotypically abnormal cells as claimed in claim 1, wherein said step of obtaining a microscopic image of the nucleus corresponding to the cell to be identified comprises:
segmenting the cell microscopic image corresponding to the blood sample based on a preset segmentation network to obtain an initial cell nucleus microscopic image corresponding to the blood sample;
determining candidate cell nucleuses with over segmentation based on the position relation of each cell nucleus and the number of fluorescence signal points in the cell nucleus in initial cell nucleus microscopic images corresponding to different fluorescence channels of the blood sample, and correcting the candidate cell nucleuses based on a convex hull detection algorithm and a convex defect algorithm to obtain cell nucleus corrected microscopic images corresponding to the blood sample;
and taking the cell nucleus correction microscopic image corresponding to the blood sample as the cell nucleus microscopic image corresponding to the cell to be identified.
3. The method for determining karyotypically abnormal cells as claimed in claim 2, wherein said step of modifying said candidate cell nuclei based on a convex hull detection algorithm and a convex defect algorithm comprises:
determining a cell nucleus set formed by target cell nuclei of which the sum of the number of the fluorescence signal points is a preset value in the candidate cell nuclei;
merging the target cell nucleuses in the cell nucleus set to obtain a corresponding corrected cell nucleus;
determining a contour point set which is farthest away from a convex hull on the contour of the corrected cell nucleus based on a convex hull detection algorithm and a convex defect algorithm, and determining a target contour point in the contour point set based on a preset distance threshold, wherein the target contour point is a contour point in the contour point set, and the distance between the contour point set and the convex hull exceeds the preset distance threshold;
determining whether the modified cell nuclei are correct based on the number of the target contour points, and deleting incorrect modified cell nuclei.
4. The method for determining the karyotype abnormal cell according to claim 1, wherein the screening of the initial karyotype associated feature set based on the ReliefF algorithm and the maximum-correlation-minimum-redundancy mRMR algorithm to obtain the candidate karyotype associated feature set specifically includes:
respectively performing feature screening on each feature subset in the initial karyotype associated feature set based on a Relieff algorithm and an mrMR algorithm to obtain at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
and combining the candidate morphological characteristics, the candidate texture characteristics, the candidate color characteristics and the candidate edge characteristics to obtain the candidate karyotype association characteristic set.
5. The method for determining the karyotypically abnormal cell according to claim 4, wherein the feature screening is performed on each feature subset in the initial karyotypically correlated feature set based on a ReliefF algorithm and an mRMR algorithm, specifically including:
determining a target feature subset, and acquiring a feature value corresponding to each sample in a training sample set and the target feature subset;
determining the weight corresponding to each feature in the target feature subset through a Relieff algorithm based on the feature value corresponding to each sample in the training sample set and the target feature subset;
screening the features in the target feature subset based on the weight threshold corresponding to the target feature subset, and determining a first feature combination corresponding to the target feature subset;
and determining first average mutual information between each feature and the class C in the first feature combination and second average mutual information between each feature based on an mRMR algorithm, and screening the features in the first feature combination based on the first average mutual information and the second average mutual information to obtain a second feature combination corresponding to the target feature subset.
6. The method of determining karyotypically abnormal cells according to claim 5, wherein the feature in the second combination of features is a feature in the first combination of features that maximizes the difference or quotient between the first average mutual information and the second average mutual information.
7. The method for determining the karyotypically abnormal cell according to claim 1, wherein the secondary screening of the candidate karyotype-associated feature set based on the mRMR algorithm and the backward selection method specifically includes:
re-screening the features in the candidate karyotype associated feature set based on an mRMR algorithm to determine a potential karyotype associated feature set;
and selecting the features in the potential karyotype association feature set based on a backward selection method, and determining the target karyotype association feature.
8. The method according to claim 5, wherein the determining, by a Relieff algorithm, a weight corresponding to each feature in the target feature subset based on the feature value corresponding to each sample in the training sample set and the target feature subset comprises:
determining the difference between the feature values corresponding to the target sample and the similar nearest neighbor samples with the preset number and the difference between the feature values corresponding to the target sample and the different nearest neighbor samples with the preset number based on the feature values corresponding to the samples in the training sample set and the target feature subset;
and updating the weight corresponding to each feature in the target feature subset based on the difference between the feature values corresponding to the target sample and the similar nearest neighbor samples in the preset number and the difference between the feature values corresponding to the target sample and the different nearest neighbor samples in the preset number until the updating frequency reaches a preset threshold value.
9. The method of determining karyotypically abnormal cells according to claim 1, wherein the target karyotype-associated characteristic includes: the cell nucleus perimeter, the cell nucleus area, the cell nucleus circularity, the cell nucleus elongation, the cell nucleus aspect ratio, the cell nucleus rectangularity, the cell nucleus firmness, the cell nucleus direction degree, the cell nucleus long axis, the cell nucleus short axis, the cell nucleus microscopic image sharpness, the cell nucleus microscopic image total signal intensity, the cell nucleus microscopic image average signal intensity, the cell nucleus microscopic image signal intensity variance and the cell nucleus microscopic image corresponding Zernike moment.
10. An apparatus for determining karyotype abnormal cells, comprising:
the target karyotype association characteristic value determining module is used for determining a target karyotype association characteristic value corresponding to the cell to be identified based on the cell nucleus microscopic image corresponding to the cell to be identified; the target karyotype correlation characteristics are determined after screening a candidate karyotype correlation characteristic set from an initial karyotype correlation characteristic set based on a Relieff algorithm and a maximum correlation minimum redundancy (mRMR) algorithm, and performing secondary screening on the candidate karyotype correlation characteristic set based on an mRMR algorithm and a backward selection method; the initial karyotype associated feature set comprises a morphological feature subset, a texture feature subset, a color feature subset and an edge feature subset, and the candidate karyotype associated feature set comprises at least one candidate morphological feature, at least one candidate texture feature, at least one candidate color feature and at least one candidate edge feature;
the karyotype abnormal cell determination module is used for inputting the target karyotype associated characteristic value corresponding to the cell to be identified into the trained karyotype abnormal cell determination model and outputting a karyotype abnormal cell identification result corresponding to the target karyotype associated characteristic value;
the karyotype abnormal cell determination model is obtained by training a target karyotype association characteristic value sample and a predetermined karyotype abnormal cell identification result label.
11. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the steps of the method for determining karyotypically abnormal cells as claimed in any one of claims 1 to 9 when executing the program.
12. A non-transitory computer-readable storage medium, on which a computer program is stored, wherein the computer program, when being executed by a processor, implements the steps of the method for determining karyotypically abnormal cells according to any one of claims 1 to 9.
CN202210918890.9A 2022-08-02 2022-08-02 Method and device for determining karyotype abnormal cells Active CN114998332B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210918890.9A CN114998332B (en) 2022-08-02 2022-08-02 Method and device for determining karyotype abnormal cells

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210918890.9A CN114998332B (en) 2022-08-02 2022-08-02 Method and device for determining karyotype abnormal cells

Publications (2)

Publication Number Publication Date
CN114998332A true CN114998332A (en) 2022-09-02
CN114998332B CN114998332B (en) 2022-11-22

Family

ID=83021186

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210918890.9A Active CN114998332B (en) 2022-08-02 2022-08-02 Method and device for determining karyotype abnormal cells

Country Status (1)

Country Link
CN (1) CN114998332B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115564776A (en) * 2022-12-05 2023-01-03 珠海圣美生物诊断技术有限公司 Abnormal cell sample detection method and device based on machine learning
CN117218139A (en) * 2023-09-12 2023-12-12 珠海横琴圣澳云智科技有限公司 Method and device for determining cell density of sample
CN117593746A (en) * 2024-01-18 2024-02-23 武汉互创联合科技有限公司 Cell division balance evaluation system and device based on target detection

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130216118A1 (en) * 2010-11-05 2013-08-22 Peter Keith Rogan Centromere Detector and Method for Determining Radiation Exposure From Chromosome Abnormalities
CN106650314A (en) * 2016-11-25 2017-05-10 中南大学 Method and system for predicting amino acid mutation
CN108550148A (en) * 2018-04-13 2018-09-18 重庆大学 Nucleus in histotomy micro-image divides automatically and classifying identification method
CN111429468A (en) * 2020-03-18 2020-07-17 合肥凯碧尔高新技术有限公司 Cell nucleus segmentation method, device, equipment and storage medium
CN112288760A (en) * 2020-10-30 2021-01-29 东莞太力生物工程有限公司 Adherent cell image screening method and system and cell image analysis method
CN114283407A (en) * 2021-12-24 2022-04-05 江苏康尚生物医疗科技有限公司 Self-adaptive automatic leukocyte segmentation and subclass detection method and system
CN114495098A (en) * 2022-01-30 2022-05-13 南水北调中线干线工程建设管理局 Diaxing algae cell statistical method and system based on microscope image

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130216118A1 (en) * 2010-11-05 2013-08-22 Peter Keith Rogan Centromere Detector and Method for Determining Radiation Exposure From Chromosome Abnormalities
CN106650314A (en) * 2016-11-25 2017-05-10 中南大学 Method and system for predicting amino acid mutation
CN108550148A (en) * 2018-04-13 2018-09-18 重庆大学 Nucleus in histotomy micro-image divides automatically and classifying identification method
CN111429468A (en) * 2020-03-18 2020-07-17 合肥凯碧尔高新技术有限公司 Cell nucleus segmentation method, device, equipment and storage medium
CN112288760A (en) * 2020-10-30 2021-01-29 东莞太力生物工程有限公司 Adherent cell image screening method and system and cell image analysis method
CN114283407A (en) * 2021-12-24 2022-04-05 江苏康尚生物医疗科技有限公司 Self-adaptive automatic leukocyte segmentation and subclass detection method and system
CN114495098A (en) * 2022-01-30 2022-05-13 南水北调中线干线工程建设管理局 Diaxing algae cell statistical method and system based on microscope image

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
周琪: "特征选择与特征学习算法研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115564776A (en) * 2022-12-05 2023-01-03 珠海圣美生物诊断技术有限公司 Abnormal cell sample detection method and device based on machine learning
CN115564776B (en) * 2022-12-05 2023-03-10 珠海圣美生物诊断技术有限公司 Abnormal cell sample detection method and device based on machine learning
CN117218139A (en) * 2023-09-12 2023-12-12 珠海横琴圣澳云智科技有限公司 Method and device for determining cell density of sample
CN117218139B (en) * 2023-09-12 2024-05-24 珠海横琴圣澳云智科技有限公司 Method and device for determining cell density of sample
CN117593746A (en) * 2024-01-18 2024-02-23 武汉互创联合科技有限公司 Cell division balance evaluation system and device based on target detection
CN117593746B (en) * 2024-01-18 2024-04-19 武汉互创联合科技有限公司 Cell division balance evaluation system and device based on target detection

Also Published As

Publication number Publication date
CN114998332B (en) 2022-11-22

Similar Documents

Publication Publication Date Title
CN114998332B (en) Method and device for determining karyotype abnormal cells
EP2302583B1 (en) Image segmentation
Liao et al. An accurate segmentation method for white blood cell images
US5933519A (en) Cytological slide scoring apparatus
CN112215800B (en) Machine learning-based overlapped chromosome recognition and segmentation method
CN115035518B (en) Method and device for identifying fluorescent staining signal points in cell nucleus image
CN110148126B (en) Blood leukocyte segmentation method based on color component combination and contour fitting
CN114463609B (en) Method and device for determining fluorescent mixed points in cell microscopic image
CN114240978B (en) Cell edge segmentation method and device based on adaptive morphology
CN113393454A (en) Method and device for segmenting pathological target examples in biopsy tissues
CN115170518A (en) Cell detection method and system based on deep learning and machine vision
CN111126162A (en) Method, device and storage medium for identifying inflammatory cells in image
CN114037868B (en) Image recognition model generation method and device
Ali et al. Optic Disc Localization in Retinal Fundus Images Based on You Only Look Once Network (YOLO).
CN110458846B (en) Cell image segmentation method based on graph path search and deep learning
CN115564776B (en) Abnormal cell sample detection method and device based on machine learning
CN116665210A (en) Cell classification method and device based on multichannel information fusion
CN115100648B (en) Method and device for filtering multi-channel fluorescent impurities in cell microscopic image
CN113591601B (en) Method and device for identifying hyphae in cornea confocal image
CN113469939B (en) HER-2 immunohistochemical automatic interpretation system based on characteristic curve
CN113537253A (en) Infrared image target detection method and device, computing equipment and storage medium
CN110021016B (en) Calcification detection method
Liu et al. Medical image segmentation and retrieval via deformable models
Bergemann et al. A statistically driven approach for image segmentation and signal extraction in cDNA microarrays
CN116486403B (en) Cell discrimination model construction method, device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant