CN114512177A - Method and device for tracing pedigree of crop breeding material source - Google Patents

Method and device for tracing pedigree of crop breeding material source Download PDF

Info

Publication number
CN114512177A
CN114512177A CN202210407868.8A CN202210407868A CN114512177A CN 114512177 A CN114512177 A CN 114512177A CN 202210407868 A CN202210407868 A CN 202210407868A CN 114512177 A CN114512177 A CN 114512177A
Authority
CN
China
Prior art keywords
source
parent
breeding
subset
materials
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210407868.8A
Other languages
Chinese (zh)
Other versions
CN114512177B (en
Inventor
赵向宇
王开义
刘忠强
韩焱云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Research Center of Information Technology of Beijing Academy of Agriculture and Forestry Sciences
Original Assignee
Research Center of Information Technology of Beijing Academy of Agriculture and Forestry Sciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Research Center of Information Technology of Beijing Academy of Agriculture and Forestry Sciences filed Critical Research Center of Information Technology of Beijing Academy of Agriculture and Forestry Sciences
Priority to CN202210407868.8A priority Critical patent/CN114512177B/en
Publication of CN114512177A publication Critical patent/CN114512177A/en
Application granted granted Critical
Publication of CN114512177B publication Critical patent/CN114512177B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B10/00ICT specially adapted for evolutionary bioinformatics, e.g. phylogenetic tree construction or analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Theoretical Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Animal Behavior & Ethology (AREA)
  • Physiology (AREA)
  • Bioethics (AREA)
  • Databases & Information Systems (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention provides a method and a device for tracing a pedigree of a crop breeding material source, wherein the method comprises the following steps: obtaining a breeding material data set, wherein the breeding material data set represents the genetic relationship of all breeding materials; respectively screening subsets according to known or unknown combination conditions of female parent materials or male parent materials of the breeding materials in the breeding material data set, and correspondingly updating the information of the breeding material source male parent and the source female parent of each subset; and recursively searching the source parent information of the materials in each subset according to the target breeding materials, and constructing a source pedigree tree. The method can realize automatic construction of the pedigree of the breeding material source, and reduce the labor intensity of breeding workers; meanwhile, the method is favorable for visually presenting a breeding material source pedigree, assisting breeding workers in more efficiently and visually analyzing the genetic relationship of the variety, identifying the similar variety, analyzing the advantages and the disadvantages of the variety and optimizing the variety breeding scheme.

Description

Method and device for tracing pedigree of crop breeding material source
Technical Field
The invention relates to the field of crop breeding, in particular to a pedigree tracing method and device for crop breeding materials.
Background
The breeding is strategic and basic industry, and breeds with high yield, high quality, wide adaptability and multiple resistance are bred, are the fundamental power of innovation and development of the breeding, and are the important guarantee of grain safety.
The traits of the varieties are usually the aggregation of the traits of parent materials, the variety breeding is a progressive improvement process, most of the current main varieties are subjected to hybridization improvement for several times, and after hybridization, 6 generations or even more self-selection and purification processes are usually carried out, so that the genetic relationship is complex. Pedigree is an important technology for recording the history and the genetic relationship of breeding material strains, and pedigree recording is mainly completed manually at present. Complex relationships present a great challenge to manually record pedigrees. In addition, the pedigree contains a large amount of self-selection and purification information, and the complexity of variety consanguinity structure analysis is improved. At present, the pedigree tracing of crop breeding materials is difficult to realize by a manual recording method.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a method and a device for tracing a pedigree of a crop breeding material source.
The invention provides a pedigree tracing method for crop breeding material sources, which comprises the following steps: acquiring a breeding material data set, wherein the breeding material data set records the genetic relationship of all breeding materials; respectively screening subsets according to the known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and correspondingly updating the source male parent and source female parent information of each subset; and (4) recursively searching the source parent and parent information of each material according to the target breeding material to construct a source genealogy tree.
According to the method for tracing the pedigree of the crop breeding material source, provided by the invention, the acquisition of the breeding material data set comprises the following steps: constructing a ternary data form data set according to the breeding material identification, the female parent material identification and the male parent material identification corresponding to the breeding material; wherein, the selfing material only records the female parent material mark, and the male parent material mark is empty; the data positions corresponding to the female parent material and the male parent material identifications of the unknown source materials are null.
According to the method for tracing the pedigree of the source of the crop breeding material provided by the invention, before screening subsets according to the known or unknown combination condition of the female parent material or the male parent material of each breeding material in the breeding material data set, the method further comprises the following steps: screening materials which exist in a parent material set but do not exist in a progeny material set according to a breeding material data set M to form a material set Q, and expanding M by using Q in the same data form of M; wherein, each breeding material forms a progeny material set O, and the male parent and female parent materials form a parent material set P.
According to the method for tracing the pedigree of the source of the crop breeding material provided by the invention, the subsets are respectively screened according to the known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and the information of the source male parent and the source female parent of each subset is correspondingly updated, wherein the method comprises the following steps: screening the breeding materials in the breeding material data set M and adding the breeding materials into the material subset A under the condition that the female parent material identification and the male parent material identification are not null, and updating the source parent and female parent information of all the materials in the material subset A as the male parent and the female parent of each material; screening the breeding materials with unknown sources in the breeding material data set by taking the female parent material identification as null as a condition, and adding the material subset U; screening materials in the material subset M-A-U and adding the materials into the material subset B under the condition that the male parent material mark is empty and the female parent material mark is the material in the U, and updating all material source female parent information in the material subset B as the female parent of each material; selecting the materials in the material subset C to be added into the material subset C 'under the conditions that the male parent material mark is empty and the source female parent of the female parent material is not empty, and updating the source parent-parent information of all the materials in the material subset C' as the source male parent and the source female parent of each material female parent; wherein the material subset C is obtained according to M-A-U-B; removing material from material subset C' to renew material subset C; and if the updated material subset C is an empty set, ending the process, otherwise, repeating the process of screening the materials in the material subset C, adding the materials into the material subset C' and updating the material subset C under the condition that the male parent material mark is empty and the female parent material source is not empty.
According to the method for tracing the source pedigree of the crop breeding material, provided by the invention, the source father and mother information of each material is recursively searched according to the target breeding material, so that a source pedigree tree is constructed, and the method comprises the following steps: taking a target material as a root node, initializing a source family tree, and adding the root node into a node set to be expanded; taking any node from the node set to be expanded as a current node, inquiring source female parent and source male parent information of the current node, and marking corresponding information in a source genealogy tree; adding a source female parent and a source male parent of a current node into a node set to be expanded, and removing the current node from the node set to be expanded; skipping if the source female parent or the source male parent information is empty; and if the set of nodes to be expanded is an empty set, ending the process, otherwise, repeating the process of taking any node from the set of nodes to be expanded as the current node until all nodes are removed from the set of nodes to be expanded.
The invention also provides a device for tracing the pedigree of the crop breeding material source, which comprises: the information acquisition module is used for acquiring a breeding material data set, and the breeding material data set records the genetic relationship of all breeding materials; the source updating module is used for respectively screening the subsets according to the known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and correspondingly updating the source male parent and source female parent information of each subset; and the pedigree tracing module is used for recursively searching the source parent and parent information of each material according to the target breeding material to construct a source pedigree tree.
According to the method and the device for tracing the pedigree of the crop breeding material source, provided by the invention, the source updating module is specifically used for: screening the breeding materials in the breeding material data set M and adding the breeding materials into the material subset A under the condition that the female parent material identification and the male parent material identification are not null, and updating the source parent-parent information of all the materials in the material subset A as the male parent and the female parent of the respective materials; screening the breeding materials with unknown sources in the breeding material data set by taking the female parent material identification as null as a condition, and adding the material subset U; screening materials in the material subset M-A-U and adding the materials into the material subset B under the condition that the male parent material mark is empty and the female parent material mark is the material in the U, and updating all material source female parent information in the material subset B as the female parent of each material; selecting the materials in the material subset C to be added into the material subset C 'under the conditions that the male parent material mark is empty and the source female parent of the female parent material is not empty, and updating the source parent-parent information of all the materials in the material subset C' as the source male parent and the source female parent of each material female parent; wherein the material subset C is obtained according to M-A-U-B; removing material from material subset C' to renew material subset C; and if the updated material subset C is an empty set, ending the process, otherwise, repeating the process of screening the materials in the material subset C, adding the materials into the material subset C' and updating the material subset C under the condition that the male parent material mark is empty and the female parent material source is not empty.
The invention also provides an electronic device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor executes the program to realize the method for tracing the source pedigree of the crop breeding material.
The present invention also provides a non-transitory computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements a method for pedigree tracing of crop breeding material origin as described in any one of the above.
The invention also provides a computer program product comprising a computer program which, when executed by a processor, implements a method for pedigree tracing of crop breeding material as described in any one of the above.
According to the method and the device for tracing the pedigree of the crop breeding material source, provided by the invention, the information of the male parent and the female parent of the breeding material source of each subset is updated, the information of the male parent and the female parent of the source of the material in each subset is recursively searched according to the target breeding material, and the pedigree tree of the source is constructed, so that the automatic construction of the pedigree of the breeding material source can be realized, and the labor intensity of breeding workers is reduced; meanwhile, the method is favorable for visually presenting a breeding material source pedigree, assisting breeding workers in more efficiently and visually analyzing the genetic relationship of the variety, identifying the similar variety, analyzing the advantages and the disadvantages of the variety and optimizing the variety breeding scheme.
Drawings
In order to more clearly illustrate the technical solutions of the present invention or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.
FIG. 1 is a schematic flow chart of a pedigree tracing method for crop breeding materials provided by the invention;
FIG. 2 is a schematic diagram of the pedigree tracing results provided by the present invention;
FIG. 3 is a schematic structural diagram of a pedigree tracing device for crop breeding materials provided by the present invention;
fig. 4 is a schematic structural diagram of an electronic device provided in the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without inventive step based on the embodiments of the present invention, are within the scope of protection of the present invention.
The invention provides a method and a device for tracing a pedigree of a crop breeding material source, aiming at the problems, and the method and the device can assist breeding workers in analyzing the genetic relationship of varieties, identifying similar varieties, analyzing the advantages and the disadvantages of the varieties, optimizing the breeding scheme of the varieties and the like through automatic tracing and analysis of the blood relationship structure of the varieties. Before this, the relevant terms are explained as follows:
genetic relationship analysis: the genetic relationship among families is analyzed, and breeding combination homogenization is reasonably avoided; necessary basis is provided for determining the core backbone parent through the appearance among families.
Tracing of genetic origin: tracing the blood relationship of the parents, carrying out inheritance proportion of the characters of the parents and analyzing the main advantages and disadvantages of the combination; the secondary improvement target is established in an auxiliary mode; provides basis for the popularization and application of varieties.
The method and apparatus for source pedigree tracing of crop breeding material according to the present invention will be described with reference to FIGS. 1 to 4. Fig. 1 is a schematic flow chart of a pedigree tracing method for a crop breeding material provided by the present invention, and as shown in fig. 1, the present invention provides a pedigree tracing method for a crop breeding material, including:
s1, acquiring a breeding material data set, wherein the breeding material data set records the genetic relationship of all breeding materials.
The data set M represents the genetic relationship between the male parent and the female parent of all retrospective breeding materials, that is, what retrospective breeding materials include, and which of the corresponding male parent and the female parent of each retrospective breeding material is, which can be identified by name.
In one embodiment, the obtaining a breeding material dataset M comprises: constructing a ternary data form data set according to the breeding material identification, the female parent material identification and the male parent material identification corresponding to the breeding material; wherein, the selfing material only records the female parent material mark, and the male parent material mark is empty; and the data positions corresponding to the female parent material identification and the male parent material identification of the unknown source material are null.
The breeding material data set M can be collected and constructed in a ternary data form of { material name, female parent name, male parent name }. The breeding material data set M used in this example is shown in table 1.
TABLE 1
Figure 655052DEST_PATH_IMAGE001
S2, respectively screening subsets according to the known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and correspondingly updating the source male parent and the source female parent information of each subset.
Specifically, each element in M may be represented using { material name, female parent name, male parent name, source female parent, source male parent } quinary data, or denoted as M'. Source maternal and source paternal information for all materials were empty before screening subsets.
The screening and updating of the source male parent and the source female parent information can be performed by the following combinations, respectively:
and screening hybrid materials in the material data set M to be added into the material subset A under the condition that neither the female parent name nor the male parent name is empty.
Unknown source materials in the screening material dataset M were added to the material subset using the parent name null as a condition. On this basis, the paternal material is identified as the empty screening material subset.
The material subset is screened by using the conditions that the name of a father is empty and the source female parent of the female parent material is not empty.
And S3, recursively searching the parent information of each material source according to the target material, and constructing a source family tree.
The target material is the material of the breeding material to be traced, and the target material is located by taking the name YS17 as a query condition. And (3) taking the target material as a root node, initializing a source family tree, recursively searching the parent and parent information in the screening subset, and finally constructing the family tree.
According to the method for tracing the pedigree of the crop breeding material source, the male parent information and the female parent information of the breeding material source of each subset are updated, the parent information of the source of the material in each subset is recursively searched according to the target breeding material, the pedigree tree of the source is constructed, the automatic construction of the pedigree of the breeding material source can be realized, and the labor intensity of breeding workers is reduced; meanwhile, the method is favorable for visually presenting a breeding material source pedigree, assisting breeding workers in more efficiently and visually analyzing the genetic relationship of the variety, identifying the similar variety, analyzing the advantages and the disadvantages of the variety and optimizing the variety breeding scheme.
In one embodiment, before the screening subsets according to the known or unknown condition of the female parent material or the male parent material of each breeding material in M, the method further comprises:
screening materials which exist in the parent material set but do not exist in the offspring material set according to M to form a material set Q, and expanding M by using Q in the same data form of M; wherein, each breeding material forms a progeny material set O, and the male parent and female parent materials form a parent material set P.
The method comprises the following specific steps:
firstly, traversing a breeding material data set M, adding the material name of each material into a progeny material set O, and adding the female parent name and the male parent name into a parent material set P.
The progeny material set O is { YS03, YS04, YS05, YS06, YS07, YS08, YS09, YS10, YS11, YS12, YS13, YS14, YS15, YS16, YS17, YS18, YS19, YS20}, and the parent material set P is { YS01, YS02, YS03, YS04, YS05, YS06, YS07, YS08, YS09, YS10, YS11, YS12, YS13, YS14, YS15, YS16, YS17, YS19 }.
Next, screening materials present in the parent material set P but not present in the offspring material set O and adding them to the material set Q, i.e.:
Q=P-O;
the material set Q is { YS01, YS02 }.
And finally, taking the female parent name and the male parent name as the materials in the empty expansion material set Q, and adding the expanded ternary data into a breeding material data set M. The expanded breeding material data set M is shown in table 2.
TABLE 2
Figure 815906DEST_PATH_IMAGE002
The augmented data set M is used for screening and updating in the above step S2.
The invention enables breeding materials covered by the breeding material data set M to be more comprehensive through expansion.
In one embodiment, the screening subsets separately according to the known or unknown condition of the female parent material or male parent material of each breeding material in the breeding material data set, and updating the source male parent and source female parent information of each subset accordingly, comprises:
screening the breeding materials in the breeding material data set M and adding the breeding materials into the material subset A under the condition that the female parent material identification and the male parent material identification are not null, and updating the source parent-parent information of all the materials in the material subset A as the male parent and the female parent of the respective materials;
screening the breeding materials with unknown sources in the breeding material data set by taking the female parent material identification as null as a condition, and adding the material subset U;
screening materials in the material subset M-A-U and adding the materials into the material subset B under the condition that the male parent material mark is empty and the female parent material mark is the material in the U, and updating all material source female parent information in the material subset B as the female parent of each material;
selecting the materials in the material subset C to be added into the material subset C 'under the conditions that the male parent material mark is empty and the source female parent of the female parent material is not empty, and updating the source parent-parent information of all the materials in the material subset C' as the source male parent and the source female parent of each material female parent; wherein the material subset C is obtained according to M-A-U-B;
removing material from material subset C' to renew material subset C;
and if the updated material subset C is an empty set, ending the process, otherwise, repeating the process of screening the materials in the material subset C, adding the materials into the material subset C' and updating the material subset C under the condition that the male parent material mark is empty and the female parent material source is not empty.
Specifically, the embodiment S2 of the present invention includes the following steps:
s22, screening hybrid materials in the material data set M' and adding the hybrid materials into the material subset A under the condition that neither the female parent name nor the male parent name is empty; and updating the source parental information of all materials in the material subset a as the parents of the respective materials, namely:
Ai(original source) = Ai(maternal name);
ai (source male parent) = Ai (male parent name);
wherein A isiIs any material in material subset a.
The updated material subset a is shown in table 3.
TABLE 3
Figure 242600DEST_PATH_IMAGE003
S23, screening the unknown source material in the material data set M' to be added into the material subset U by using the mother name of blank as a condition.
Material subset U is shown in table 4.
TABLE 4
Figure 969248DEST_PATH_IMAGE004
S24, using the parent name as null and the parent name as the material in the material subset U as the condition, screening the material in the material subset M' -a-U to be added into the material subset B, and updating the source parent information of all the materials in the material subset B as the parent of each material, namely:
Bi(original source) = Bi(female parent name)
Wherein, BiIs any material in material subset B.
Material subset B is shown in table 5.
TABLE 5
Figure 336775DEST_PATH_IMAGE005
S25, marking material subset M' -A-U-B as material subset C. M' father with mother or without C
Material subset C is shown in table 6.
TABLE 6
Figure 516084DEST_PATH_IMAGE006
S26, using the condition that the parent name is null and the parent material source parent is non-null, screening the materials in the material subset C, adding the materials into the material subset C ', and updating the source information of all the materials in the material subset C' to the source information of each material parent, that is:
C’i(original source) = Mj(original parent)
C’i(original male parent) = Mj(Source male parent)
Wherein, C'iIs any material of material subset C', MjIs C'iThe parent of (a), namely:
C’i(parent name) = Mj(Material name)
The updated material subset C' is shown in table 7.
TABLE 7
Figure 728890DEST_PATH_IMAGE007
S27, removing material from material subset C'.
The material subset C after C' removal is shown in table 8.
TABLE 8
Figure 310044DEST_PATH_IMAGE008
S28, judging whether the material subset C is an empty set, if so, ending the step S2; if not, S26-S28 are repeated.
In this embodiment, C is not an empty set when the process is executed for the first time to S28, and S26-S28 are repeated for three rounds, so that the update of the material source information is completed, and the updated material data set M' is shown in table 9.
TABLE 9
Figure 582894DEST_PATH_IMAGE009
In one embodiment, the recursively searching source parent and parent information of each material according to the target breeding material to construct the source pedigree tree comprises: taking a target material as a root node, initializing a source family tree, and adding the root node into a node set to be expanded; taking any node from the node set to be expanded as a current node, inquiring source female parent and source male parent information of the current node, and marking corresponding information in a source genealogy tree; adding the source female parent and the source male parent of the current node into the node set to be expanded, and removing the current node from the node set to be expanded; skipping if the source female parent or the source male parent information is empty; and if the set of nodes to be expanded is an empty set, ending the process, otherwise, repeating the process of taking any node from the set of nodes to be expanded as the current node until all nodes are removed from the set of nodes to be expanded.
Specifically, S3 in the embodiment of the present invention includes the following steps:
and S30, positioning the target material according to the query condition.
The present embodiment locates the target material using the material name YS17 as a query condition.
And S31, initializing the source family tree by taking the target material as a root node.
YS17 is the root node of the source family tree.
And S32, adding the root node into the node set to be expanded.
And the set of nodes to be expanded is { YS17 }.
S33, taking any node from the node set to be expanded as the current node, inquiring the information of the source female parent and the source male parent of the node, and marking the information in the source genealogy tree.
And taking YS17 as the current node, and adding YS10 and YS14 as the source female parent and the source male parent of the current node into the source genealogical tree respectively.
And S34, adding the source female parent and the source male parent of the current node into the node set to be expanded, and skipping the information if the information of the source female parent or the source male parent is empty.
And the set of nodes to be expanded is { YS17, YS10 and YS14 }.
And S35, removing the current node from the node set to be expanded.
And the set of nodes to be expanded is { YS10, YS14 }.
S36, judging whether the node set to be expanded is an empty set, if so, ending the step S3; if not, S33-S36 are repeated.
And selecting YS10 as the current node from the non-empty set of the node set to be expanded, and repeating S33-S36 until the source pedigree tracing is completed. The final source pedigree tracing results are shown in fig. 2.
The following describes the device for tracing a pedigree of a crop breeding material provided by the present invention, and the device for tracing a pedigree of a crop breeding material described below and the method for tracing a pedigree of a crop breeding material described above can be referred to in correspondence with each other.
Fig. 3 is a schematic structural diagram of a crop breeding material pedigree tracing apparatus according to the present invention, and as shown in fig. 3, the crop breeding material pedigree tracing apparatus includes: an information acquisition module 301, a source update module 302, and a pedigree tracing module 303. The information acquisition module 301 is configured to acquire a breeding material data set, where the breeding material data set records genetic relationships of all breeding materials; the source updating module 302 is configured to respectively screen the subsets according to known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and correspondingly update the source male parent and source female parent information of each subset of the breeding materials; the pedigree tracing module 303 is configured to recursively search the source parent and parent information of each material according to the target breeding material, and construct a source pedigree tree.
In an embodiment of the apparatus, the information collecting module 301 is specifically configured to: constructing a ternary data form data set according to the breeding material identification, the female parent material identification and the male parent material identification corresponding to the breeding material; wherein, the inbred material only records the female parent material identification, and the male parent material identification is empty; the data positions corresponding to the female parent material and the male parent material identifications of the unknown source materials are null.
In an embodiment of the apparatus, the apparatus is further configured to, prior to the screening of the subsets, perform the following operations, respectively, based on known or unknown combinations of maternal material or paternal material of each breeding material in the breeding material dataset: screening materials which exist in a parent material set but do not exist in a progeny material set according to a breeding material data set M to form a material set Q, and expanding M by using Q in the same data form of M; wherein, each breeding material forms a progeny material set O, and the male parent and female parent materials form a parent material set P.
In an apparatus embodiment, the source update module 302 is specifically configured to: screening the breeding materials in the breeding material data set M and adding the breeding materials into the material subset A under the condition that the female parent material identification and the male parent material identification are not null, and updating the source parent-parent information of all the materials in the material subset A as the male parent and the female parent of the respective materials; taking the female parent material identification as null as a condition, screening the breeding material with unknown source in the breeding material data set, and adding the material subset U; screening materials in the material subset M-A-U and adding the materials into the material subset B under the condition that the male parent material mark is empty and the female parent material mark is the material in the U, and updating all material source female parent information in the material subset B as the female parent of each material; selecting the materials in the material subset C to be added into the material subset C 'under the conditions that the male parent material mark is empty and the source female parent of the female parent material is not empty, and updating the source parent-parent information of all the materials in the material subset C' as the source male parent and the source female parent of each material female parent; wherein the material subset C is obtained according to M-A-U-B; removing material from material subset C' to renew material subset C; and if the updated material subset C is an empty set, ending the process, otherwise, repeating the process of screening the materials in the material subset C, adding the materials into the material subset C' and updating the material subset C under the condition that the male parent material mark is empty and the female parent material source is not empty.
In an embodiment of the apparatus, the family tracing module 303 is specifically configured to: taking a target material as a root node, initializing a source family tree, and adding the root node into a node set to be expanded; taking any node from the node set to be expanded as a current node, inquiring source female parent and source male parent information of the current node, and marking corresponding information in a source genealogy tree; adding a source female parent and a source male parent of a current node into a node set to be expanded, and removing the current node from the node set to be expanded; skipping if the source female parent or the source male parent information is empty; and if the set of nodes to be expanded is an empty set, ending the process, otherwise, repeating the process of taking any node from the set of nodes to be expanded as the current node until all nodes are removed from the set of nodes to be expanded.
The device embodiment provided in the embodiments of the present invention is for implementing the above method embodiments, and for details of the process and the details, reference is made to the above method embodiments, which are not described herein again.
The implementation principle and the generated technical effect of the device for tracing the pedigree of the crop breeding material provided by the embodiment of the invention are the same as those of the embodiment of the method for tracing the pedigree of the crop breeding material, and for the sake of brief description, corresponding contents in the embodiment of the method for tracing the pedigree of the crop breeding material can be referred to where the embodiment of the device for tracing the pedigree of the crop breeding material is not mentioned.
Fig. 4 is a schematic structural diagram of an electronic device provided in the present invention, and as shown in fig. 4, the electronic device may include: a processor (processor)401, a communication Interface (communication Interface)402, a memory (memory)403 and a communication bus 404, wherein the processor 401, the communication Interface 402 and the memory 403 complete communication with each other through the communication bus 404. Processor 401 may invoke logic instructions in memory 403 to perform a method for crop breeding material source pedigree tracing, the method comprising: acquiring a breeding material data set, wherein the breeding material data set records the genetic relationship of all breeding materials; respectively screening subsets according to the known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and correspondingly updating the source male parent and source female parent information of each subset; and (4) recursively searching the source parent and parent information of each material according to the target breeding material to construct a source genealogy tree.
In addition, the logic instructions in the memory 403 may be implemented in the form of software functional units and stored in a computer readable storage medium when the software functional units are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In another aspect, the present invention also provides a computer program product comprising a computer program, the computer program being stored on a non-transitory computer readable storage medium, the computer program, when executed by a processor, being capable of executing the method for pedigree tracing of crop breeding material provided by the above methods, the method comprising: acquiring a breeding material data set, wherein the breeding material data set records the genetic relationship of all breeding materials; respectively screening subsets according to the known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and correspondingly updating the source male parent and source female parent information of each subset; and (4) recursively searching the source parent and parent information of each material according to the target breeding material to construct a source pedigree tree.
In yet another aspect, the present invention also provides a non-transitory computer readable storage medium having stored thereon a computer program which when executed by a processor implements a method for source pedigree tracing of crop breeding material provided by performing the methods described above, the method comprising: acquiring a breeding material data set, wherein the breeding material data set records the genetic relationship of all breeding materials; respectively screening subsets according to known or unknown conditions of female parent materials or male parent materials of the breeding materials in the breeding material data set, and correspondingly updating the source male parent and source female parent information of the breeding materials of each subset; and (4) recursively searching the source parent and parent information of each material according to the target breeding material to construct a source pedigree tree.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (9)

1. A method for tracing a pedigree of a crop breeding material source is characterized by comprising the following steps:
acquiring a breeding material data set, wherein the breeding material data set records the genetic relationship of all breeding materials;
respectively screening subsets according to the known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and correspondingly updating the source male parent and source female parent information of each subset;
and (4) recursively searching the source parent and parent information of each material according to the target breeding material to construct a source genealogy tree.
2. The method of claim 1, wherein obtaining a breeding material data set comprises:
constructing a ternary data form data set according to the breeding material identification, the female parent material identification and the male parent material identification corresponding to the breeding material;
wherein, the selfing material only records the female parent material mark, and the male parent material mark is empty; the data positions corresponding to the female parent material and the male parent material identifications of the unknown source materials are null.
3. The method for pedigree tracing of origin of crop breeding material as claimed in claim 1, wherein before said screening subsets according to known or unknown conditions of female parent material or male parent material of each breeding material in a breeding material data set, further comprising:
screening materials which exist in a parent material set but do not exist in a progeny material set according to a breeding material data set M to form a material set Q, and expanding M by using Q in the same data form of M; wherein, each breeding material forms a progeny material set O, and the male parent and female parent materials form a parent material set P.
4. The method for tracing pedigree of origin of crop breeding material according to claim 1, wherein the step of screening subsets according to known or unknown conditions of female parent material or male parent material of each breeding material in the breeding material data set, and updating the information of male parent and female parent origin of each breeding material in each subset accordingly comprises:
screening the breeding materials in the breeding material data set M and adding the breeding materials into the material subset A under the condition that the female parent material identification and the male parent material identification are not null, and updating the source parent-parent information of all the materials in the material subset A as the male parent and the female parent of the respective materials;
taking the female parent material identification as null as a condition, screening the breeding material with unknown source in the breeding material data set, and adding the material subset U;
screening materials in the material subset M-A-U and adding the materials into the material subset B under the conditions that the male parent material identification is empty and the female parent material identification is materials in the material subset U, and updating all material source female parent information in the material subset B to be female parents of all the materials;
selecting the materials in the material subset C to be added into the material subset C 'under the conditions that the male parent material mark is empty and the source female parent of the female parent material is not empty, and updating the source parent-parent information of all the materials in the material subset C' as the source male parent and the source female parent of each material female parent; wherein the material subset C is obtained according to M-A-U-B;
removing material from material subset C' to renew material subset C;
and if the updated material subset C is an empty set, ending the process, otherwise, repeating the process of screening the materials in the material subset C, adding the materials into the material subset C' and updating the material subset C under the condition that the male parent material mark is empty and the female parent material source is not empty.
5. The method for tracing the pedigree of a crop breeding material according to claim 1, wherein the step of recursively searching parent and parent information of a source of each material according to a target breeding material to construct a pedigree tree comprises:
taking a target material as a root node, initializing a source family tree, and adding the root node into a node set to be expanded;
taking any node from the node set to be expanded as a current node, inquiring source female parent and source male parent information of the current node, and marking corresponding information in a source genealogy tree;
adding a source female parent and a source male parent of a current node into a node set to be expanded, and removing the current node from the node set to be expanded; skipping if the source female parent or the source male parent information is empty;
and if the set of nodes to be expanded is an empty set, ending the process, otherwise, repeating the process of taking any node from the set of nodes to be expanded as the current node until all nodes are removed from the set of nodes to be expanded.
6. A device for tracing a pedigree of a crop breeding material source is characterized by comprising:
the information acquisition module is used for acquiring a breeding material data set, and the breeding material data set records the genetic relationship of all breeding materials;
the source updating module is used for respectively screening the subsets according to the known or unknown conditions of the female parent material or the male parent material of each breeding material in the breeding material data set, and correspondingly updating the source male parent and source female parent information of each subset;
and the pedigree tracing module is used for recursively searching the source parent and parent information of each material according to the target breeding material to construct a source pedigree tree.
7. The crop breeding material source pedigree tracing apparatus of claim 6, wherein said source update module is specifically configured to:
screening the breeding materials in the breeding material data set M and adding the breeding materials into the material subset A under the condition that the female parent material identification and the male parent material identification are not null, and updating the source parent-parent information of all the materials in the material subset A as the male parent and the female parent of the respective materials;
screening the breeding materials with unknown sources in the breeding material data set by taking the female parent material identification as null as a condition, and adding the material subset U;
screening materials in the material subset M-A-U and adding the materials into the material subset B under the condition that the male parent material mark is empty and the female parent material mark is the material in the U, and updating all material source female parent information in the material subset B as the female parent of each material;
selecting the materials in the material subset C to be added into the material subset C 'under the conditions that the male parent material mark is empty and the source female parent of the female parent material is not empty, and updating the source parent-parent information of all the materials in the material subset C' as the source male parent and the source female parent of each material female parent; wherein the material subset C is obtained according to M-A-U-B;
removing material from material subset C' to renew material subset C;
and if the updated material subset C is an empty set, ending the process, otherwise, repeating the process of screening the materials in the material subset C, adding the materials into the material subset C' and updating the material subset C under the condition that the male parent material mark is empty and the female parent material source is not empty.
8. An electronic device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor when executing the program implements the crop breeding material pedigree tracing method of any one of claims 1 to 5.
9. A non-transitory computer readable storage medium having stored thereon a computer program, wherein the computer program when executed by a processor implements the crop breeding material pedigree tracing method according to any one of claims 1 to 5.
CN202210407868.8A 2022-04-19 2022-04-19 Method and device for tracing pedigree of crop breeding material source Active CN114512177B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210407868.8A CN114512177B (en) 2022-04-19 2022-04-19 Method and device for tracing pedigree of crop breeding material source

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210407868.8A CN114512177B (en) 2022-04-19 2022-04-19 Method and device for tracing pedigree of crop breeding material source

Publications (2)

Publication Number Publication Date
CN114512177A true CN114512177A (en) 2022-05-17
CN114512177B CN114512177B (en) 2022-07-22

Family

ID=81554873

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210407868.8A Active CN114512177B (en) 2022-04-19 2022-04-19 Method and device for tracing pedigree of crop breeding material source

Country Status (1)

Country Link
CN (1) CN114512177B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009183224A (en) * 2008-02-07 2009-08-20 Mie Prefecture Method for breeding first filial hybrid of seed-propagation type blight-resistant strawberry variety
CN107562964A (en) * 2017-10-17 2018-01-09 山东省农业科学院玉米研究所 A kind of crops pedigree construction method, database, retroactive method and system
CN113611369A (en) * 2021-08-13 2021-11-05 梅州市农林科学院粮油研究所(梅州市农林科学院客都米研究所) Rice breeding information data management system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009183224A (en) * 2008-02-07 2009-08-20 Mie Prefecture Method for breeding first filial hybrid of seed-propagation type blight-resistant strawberry variety
CN107562964A (en) * 2017-10-17 2018-01-09 山东省农业科学院玉米研究所 A kind of crops pedigree construction method, database, retroactive method and system
CN113611369A (en) * 2021-08-13 2021-11-05 梅州市农林科学院粮油研究所(梅州市农林科学院客都米研究所) Rice breeding information data management system

Also Published As

Publication number Publication date
CN114512177B (en) 2022-07-22

Similar Documents

Publication Publication Date Title
CN110532019A (en) A kind of method of software code fragment history retrospect
CN112231460B (en) Construction method of question-answering system based on agricultural encyclopedia knowledge graph
US7672958B2 (en) Method and system to identify records that relate to a pre-defined context in a data set
CN108804554B (en) Database query method, database query device, server and storage medium
CN108664509B (en) Method, device and server for ad hoc query
CN109165040B (en) Code plagiarism suspicion detection method based on random forest model
CN111612428A (en) Project progress visualization method and system based on work structure decomposition
CN108846695A (en) The prediction technique and device of terminal replacement cycle
US7072516B1 (en) Characteristic-based classification system
CN110737779B (en) Knowledge graph construction method and device, storage medium and electronic equipment
JPH10149280A (en) Device and method for program composition
CN114512177B (en) Method and device for tracing pedigree of crop breeding material source
CN111104476A (en) Archive data generation method, archive data generation device, and readable storage medium
JP2010055381A (en) Method for classifying subject according to way of proceeding with process to be noticed in business or work, device therefor and program
CN110990423B (en) SQL statement execution method, device, equipment and storage medium
CN116226108A (en) Data management method and system capable of realizing different management degrees
CN114464248B (en) Method and system for calculating genetic relationship in breeding material family
CN113918606B (en) Log query method and device
CN115577147A (en) Visual information map retrieval method and device, electronic equipment and storage medium
CN109284393B (en) Fusion method for family tree character attribute names
CN109189833B (en) Knowledge base mining method and device
CN114528333A (en) Test question implicit knowledge attribute association mining and related test question pushing method and system based on attribute exploration
CN112819527A (en) User grouping processing method and device
CN113096734A (en) Method for screening molecular marker combination for diploid population paternity test
US20070219989A1 (en) Document retrieval

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant