WO2022185531A1 - 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム - Google Patents
情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム Download PDFInfo
- Publication number
- WO2022185531A1 WO2022185531A1 PCT/JP2021/008696 JP2021008696W WO2022185531A1 WO 2022185531 A1 WO2022185531 A1 WO 2022185531A1 JP 2021008696 W JP2021008696 W JP 2021008696W WO 2022185531 A1 WO2022185531 A1 WO 2022185531A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data set
- pseudo
- inference
- detection model
- label
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
Definitions
- the present invention relates to a technique for associating pseudo-labels with one or more images included in a dataset used for re-learning a detection model.
- a detection model that detects objects contained in images becomes a highly accurate detection model by learning using a large number of correct data.
- the process of collecting a large amount of data and associating correct labels with the data is expensive. For this reason, in order to generate a highly accurate detection model from a small number of correct data, there is known a technique of associating false labels with non-correct data.
- Non-Patent Document 1 Since the method described in Non-Patent Document 1 requires adjustment to set an appropriate threshold, there is room for reducing the time and computational costs required for this adjustment. In other words, there is room for further reducing the cost of generating highly accurate detection models using pseudo labels.
- an object of one aspect of the present invention is to provide a technology capable of generating a highly accurate detection model while suppressing generation costs.
- An information processing apparatus includes learning means for learning a detection model using a first data set, and inputting each of one or more images included in an evaluation data set to the detection model.
- a threshold determination means for determining a threshold by referring to a result of comparison between one or more inference results obtained by and one or more correct labels attached to each of the one or more images;
- inference means for obtaining one or more inference results for each of the one or more images by inputting each of the one or more images included in the data set into the detection model; Or, among a plurality of inference results, an inference result having a reliability equal to or higher than the threshold is set as a pseudo-label, and the pseudo-label is associated with the corresponding image, thereby generating a dataset after pseudo-labeling.
- An information processing apparatus includes first learning means for learning a first detection model using a first data set, and training of a second detection model using a second data set.
- second learning means for performing learning; one or more inference results obtained by inputting each of one or more images included in the first evaluation data set into the first detection model; or a first threshold determination means for determining a first threshold with reference to a comparison result with one or more correct labels attached to each of the plurality of images; or comparing one or more inference results obtained by inputting each of a plurality of images into the second detection model and one or more correct labels attached to each of the one or more images; second threshold determination means for determining a second threshold with reference to the one or more images included in the second data set, by inputting each of the one or more images into the first detection model; first inference means for obtaining one or more inference results for each of a plurality of images; and inputting each of the one or more images included in the first data set into the second detection model.
- An information processing apparatus includes acquisition means for acquiring a target image, and detection means for detecting an object included in the target image using a target image detection model, the target
- the image detection model includes a learning process for learning the detection model using the first data set, and one or more images obtained by inputting each of one or more images included in the evaluation data set into the detection model.
- a data set generation process for generating a data set after the pseudo-labeling by setting an inference result having a reliability equal to or higher than a threshold as a pseudo-label and associating the pseudo-label with the corresponding image, and after the pseudo-labeling is learned by a pseudo-label reference learning process for learning the target image detection model by referring to the data set of .
- An information processing method includes a learning step of learning a detection model using a first data set; a threshold determination step of determining a threshold with reference to a comparison result between one or more inference results obtained by the above and one or more correct labels attached to each of the one or more images; an inference step of obtaining one or more inference results for each of the one or more images included in a data set by inputting each of the one or more images into the detection model; Or, among a plurality of inference results, an inference result having a reliability equal to or higher than the threshold is set as a pseudo-label, and the pseudo-label is associated with the corresponding image, thereby generating a dataset after pseudo-labeling. and a generating step.
- An information processing method includes obtaining a target image, and detecting an object included in the target image using a target image detection model.
- the detection model includes a learning process for learning the detection model using the first data set, and one or more inferences obtained by inputting each of one or more images included in the evaluation data set into the detection model.
- a threshold determination process for determining a threshold value by referring to a comparison result between the result and one or more correct labels attached to each of the one or more images, and one or more images included in the second data set.
- a detection model manufacturing method includes a learning step of learning a detection model using a first data set; A threshold determination step of determining a threshold by referring to a comparison result between one or more inference results obtained by inputting to and one or more correct labels attached to each of the one or more images; an inference step of obtaining one or more inference results for each of the one or more images by inputting each of the one or more images included in the two data sets into the detection model; setting an inference result having a confidence level equal to or higher than the threshold among one or more inference results by setting a pseudo-label, and associating the pseudo-label with the corresponding image to generate a data set after pseudo-labeling and a pseudo label reference learning step of learning a target image detection model for detecting an object included in the target image, using the data set to which the pseudo labels have been assigned.
- a program according to an aspect of the present invention is a program for causing a computer to function as an information processing apparatus, wherein the computer acquires a target image using an acquisition unit for acquiring a target image and a target image detection model. and the detection means for detecting an object included in the target image detection model is a learning process for learning the detection model using the first data set, the evaluation data set 1 or A threshold with reference to a comparison result between one or more inference results obtained by inputting each of a plurality of images into the detection model and one or more correct labels attached to each of the one or more images and inputting each of the one or more images contained in the second data set into the detection model to obtain one or more inference results for each of the one or more images.
- an inference process setting an inference result having a reliability equal to or higher than the threshold among one or more inference results from the inference process as a pseudo-label, and associating the pseudo-label with the corresponding image to assign the pseudo-label It is learned by a data set generation process for generating a later data set and a pseudo label reference learning process for learning the detection model for the target image by referring to the data set after the pseudo labeling.
- FIG. 1 is a block diagram showing the configuration of an information processing device according to exemplary Embodiment 1 of the present invention
- FIG. 2 is a flow chart showing the flow of an information processing method executed by the information processing apparatus shown in FIG. 1
- FIG. 1 is a block diagram showing the configuration of an information processing device according to exemplary Embodiment 1 of the present invention
- FIG. 4 is a flow chart showing the flow of an information processing method executed by the information processing apparatus shown in FIG. 3
- FIG. 7 is a block diagram showing the configuration of an information processing apparatus according to exemplary Embodiment 2 of the present invention
- FIG. 10 is a diagram showing specific examples of data contained in the first data set and the second data set according to exemplary embodiment 3 of the present invention
- FIG. 11 shows an example of data contained in a second dataset and a pseudo-labeled dataset generated from the second dataset according to illustrative embodiment 3 of the present invention
- FIG. 11 is a block diagram showing the configuration of an information processing apparatus according to exemplary Embodiment 4 of the present invention
- FIG. 10 is a diagram showing specific examples of data contained in the first data set and the second data set according to illustrative embodiment 4 of the present invention
- FIG. 10 is a diagram showing an example of data contained in a pseudo-labeled data set according to illustrative embodiment 4 of the present invention
- 1 is a block diagram showing an example of a hardware configuration of an information processing device in each exemplary embodiment of the present invention
- FIG. 10 is a diagram showing an example of data contained in a pseudo-labeled data set according to illustrative embodiment 4 of the present invention
- 1 is a block diagram showing an example of a hardware configuration of an information processing device in each exemplary embodiment of the present invention
- the information processing device 10 has a function as a dataset generation device that generates a dataset after adding a pseudo-label by attaching a pseudo-label to a target dataset.
- the information processing device 10 first learns the detection model using the first data set. Further, the information processing apparatus 10 includes one or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more inference results attached to each of the images. The threshold is determined by referring to the result of comparison with the correct label. Furthermore, the information processing apparatus 10 obtains one or more inference results for each of the one or more images included in the second data set by inputting each of the one or more images into the detection model.
- the information processing apparatus 10 sets an inference result having a reliability equal to or higher than a threshold among one or more inference results from one or more images included in the second data set as a pseudo-label, and sets the pseudo-label to the corresponding images to generate a pseudo-labeled dataset.
- FIG. 1 is a block diagram showing the configuration of an information processing device 10. As shown in FIG.
- the information processing device 10 includes a learning unit 101, a threshold determination unit 102, an inference unit 103, and a dataset generation unit 104.
- the learning unit 101 is a configuration that implements learning means in this exemplary embodiment.
- the threshold determination unit 102 is a configuration that implements threshold determination means in this exemplary embodiment.
- the inference unit 103 is a configuration that realizes inference means in this exemplary embodiment.
- the data set generation unit 104 is a configuration that implements data set generation means in this exemplary embodiment.
- a learning unit 101 learns a detection model using the first data set. Specifically, the learning unit 101 uses a first data set including one or more images to learn a detection model for detecting objects included in the images. Detection means that by inputting an image into the detection model, - Presence or absence of an object included in the image - Position of the object included in the image - Size of the object included in the image - Output of inference results regarding at least one of the category of the object included in the image. The learning unit 101 learns a detection model that receives an image as an input and outputs an inference result as described above.
- the threshold determination unit 102 determines one or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more correct answers given to each of the images.
- a threshold is determined by referring to the comparison result with the label.
- the correct label is, for one or more objects included in each of one or more images included in the evaluation data set, It is a label containing ground truth data regarding at least one of the position of an object included in the image, the size of the object included in the image, and the category of the object included in the image.
- the inference unit 103 obtains one or more inference results for each of the one or more images included in the second data set by inputting each of the one or more images into the detection model described above.
- the second data set includes one or more images different from the first data set.
- the data set generation unit 104 sets an inference result having a reliability equal to or higher than a threshold among one or more inference results by the inference unit 103 as a pseudo label, associates the pseudo label with the corresponding image, and creates a pseudo label. Generate a labeled dataset.
- the pseudo label is, for each of one or more images included in the second data set,
- the label includes data on at least one of the position of each of one or more objects inferred to be objects by the inference unit 103, the size of each object, and the category of each object.
- the pseudo label assigned to the object may or may not match the correct label.
- any one or more items of the position, size, and category of the object contained in the correct data about the object match the position, size, and category of the object in the pseudo label, and the other items match It may happen that you do not.
- adjusting the thresholds generally can require time and computational costs.
- the threshold for determining whether the inference result for each of the images included in the second data set is to be the pseudo label is automatically set.
- a configuration determined by is adopted. Therefore, according to the information processing apparatus 10 according to the exemplary embodiment, it is possible to reduce the cost for adjusting the threshold. Therefore, according to the information processing apparatus 10 according to this exemplary embodiment, it is possible to generate a highly accurate detection model while suppressing the generation cost.
- FIG. 2 is a flow diagram showing the flow of the information processing method S10.
- the information processing device 10 performs the information processing method S10 to generate a second data set including images with associated pseudo-labels.
- the information processing method S10 includes steps S101 to S104.
- Step S101 the learning unit 101 learns a detection model. Specifically, the learning unit 101 learns the detection model using the first data set. Step S101 is a learning step in this exemplary embodiment.
- Step S102 the threshold determining unit 102 determines a threshold. Specifically, the threshold determination unit 102 inputs one or more images included in the evaluation data set to the detection model to obtain one or more inference results, and the A threshold for determining a pseudo label is determined by referring to the result of comparison with one or more correct labels. Step S102 is the threshold determination step in this exemplary embodiment.
- Step S103 the inference unit 103 makes an inference. Specifically, the inference unit 103 obtains one or more inference results for each of the one or more images included in the second data set by inputting each of the one or more images into the detection model. Step S103 is an inference step in this exemplary embodiment.
- Step S104 the data set generation unit 104 generates a data set after adding pseudo labels. Specifically, the dataset generation unit 104 sets an inference result having a reliability greater than or equal to a threshold among the one or more inference results in step S103 as a pseudo label, and assigns the pseudo label to the second dataset. Generate a pseudo-labeled dataset by associating it with the corresponding image in . Step S104 is the data set generation step in this exemplary embodiment.
- step S103 is not limited to after execution of step S102.
- the execution timing may be after execution of step S101 and before execution of step S104, for example, before execution of step S102.
- the same effects as those of the information processing apparatus 10 can be obtained. That is, in the information processing method S10 according to this exemplary embodiment, the threshold value for determining whether the inference result for each of the images included in the second data set is to be the pseudo label is automatically determined. configuration is adopted. Therefore, according to the information processing method S10 according to the exemplary embodiment, it is possible to reduce the cost for adjusting the threshold. Therefore, according to the information processing method S10 according to this exemplary embodiment, it is possible to generate a highly accurate detection model while suppressing the generation cost.
- the information processing device 20 acquires a target image and uses the target image detection model to detect an object included in the image.
- the target image detection model is a re-learning of the detection model learned by the information processing apparatus 10 described above, specifically, the learning unit 101. This is a detection model that has been relearned with reference to the dataset.
- the target image detection model is not limited to this.
- the detection model for the target image may be a detection model trained using the dataset after the pseudo-labeling, for example, a new detection model trained using the dataset after the pseudo-labeling. good too.
- the new detection model is a detection model different from the detection model learned by the learning unit 101 .
- FIG. 3 is a block diagram showing the configuration of the information processing device 20. As shown in FIG.
- the information processing device 20 includes an acquisition unit 201 and a detection unit 202 .
- the acquisition unit 201 is a configuration that implements acquisition means in this exemplary embodiment.
- the detection unit 202 is a configuration that realizes detection means in this exemplary embodiment.
- the acquisition unit 201 acquires the target image.
- the target image is an image input to the detection model in order to detect an object included in the image.
- the acquisition unit 201 may acquire the target image by reading the target image stored in the information processing device 20, or may acquire the target image supplied from the imaging device.
- the acquisition unit 201 may acquire the target image via an input device (not shown).
- the acquiring unit 201 may acquire the target image from another device (not shown) communicably connected to the information processing device 20 .
- the detection unit 202 detects an object included in the target image using the target image detection model.
- the target image detection model is a detection model used to detect an object included in the target image, and the target image detection model according to the present exemplary embodiment is the above-described relearned detection model. .
- the detection unit 202 acquires an inference result output from the target image detection model by inputting the target image into the target image detection model.
- the detection unit 202 holds a target image detection model, and inputs the target image to the target image detection model.
- the detection unit 202 accesses a target image detection model stored in a storage device (not shown) and inputs a target image.
- a pseudo-label is determined using an automatically determined threshold, and a data set including images associated with the pseudo-label is used for learning.
- a configuration is adopted in which an object is detected using the target image detection model for which the above is performed. Therefore, according to the information processing apparatus 20 according to the present exemplary embodiment, an effect is obtained that an object included in an image can be detected using a target image detection model in which the cost for adjusting the threshold value is reduced. be done.
- FIG. 4 is a flow diagram showing the flow of the information processing method S20.
- the information processing device 20 executes the information processing method S20 in order to detect an object included in the target image.
- the information processing method S20 includes steps S201 and S202.
- Step S201 the acquisition unit 201 acquires a target image.
- Step S202 the detection unit 202 detects an object. Specifically, the detection unit 202 detects an object included in the target image using the target image detection model. More specifically, the detection unit 202 inputs the target image acquired by the acquisition unit 201 to the target image detection model, and acquires the inference result output by the detection model.
- the same effects as those of the information processing apparatus 20 can be obtained. That is, in the information processing method S20 according to the present exemplary embodiment, pseudo labels are determined using an automatically determined threshold, and learning is performed using a data set including images associated with the pseudo labels. A configuration is adopted in which an object is detected using a target image detection model. For this reason, according to the information processing method S20 according to the present exemplary embodiment, it is possible to detect an object included in an image using a target image detection model in which the cost for adjusting the threshold value is reduced. be done.
- the information processing apparatus 10a is a modification of the first exemplary embodiment. Specifically, the information processing device 10a acquires the first data set, and performs the learning of the detection model, the determination of the threshold value, the inference, and the creation of the data set after pseudo-labeling as described in the first exemplary embodiment. . Further, the information processing apparatus 10a learns the detection model for the target image using the generated pseudo-labeled data set. Typically, the detection model for the target image is a detection model that is re-learned with respect to the above-mentioned detection model, and is a detection model that has been re-learned with reference to the dataset after the pseudo-labeling. As described above, the target image detection model is not limited to a re-learned detection model, and may be a detection model learned using a data set after pseudo-labeling.
- FIG. 5 is a block diagram showing the configuration of the information processing device 10a.
- the information processing device 10a includes a control unit 100a and a storage unit 150a.
- the control unit 100a centrally controls each unit of the information processing device 10a.
- the storage unit 150a stores various programs and data used by the information processing apparatus 10a.
- the storage unit 150a stores an evaluation data set DSE, a data set 1 (DS1), a data set 2 (DS2), a data set 2' (DS2'), and an object detection model DM.
- the evaluation data set DSE is the evaluation data set in this exemplary embodiment.
- Data set 1 (DS1) is the first data set in this exemplary embodiment.
- Data set 2 (DS2) is the second data set in this exemplary embodiment.
- Data set 2' (DS2') is the pseudo-labeled data set in this exemplary embodiment.
- FIG. 6 is a diagram showing a specific example of data included in data set 1 (DS1) and data set 2 (DS2). Specifically, FIG. 6 shows one of the images contained in dataset 1 (DS1) and one of the images contained in dataset 2 (DS2).
- each of these images contains five objects, specifically three people and two bags.
- each of the five objects is associated with a correct label.
- the correct label is a label containing categories and bounding boxes as shown in FIG.
- the category is category information indicating the category of the object included in the image associated with the correct label, and specifically, correct data regarding the category of the object.
- each of the three persons is associated with a "person" category
- each of the two bags is associated with a "bag” category.
- the bounding box is area information indicating the area of the object included in the image associated with the correct label, and specifically, correct data regarding the position and size of the object included in the image.
- One bounding box is associated with one object, and a typical example of the bounding box is data indicating the minimum rectangle enclosing the object, as shown in FIG.
- the “image”, “dataset” and “correct label” described in each exemplary embodiment can be expressed as follows.
- the image x input to the sensing model is an element of the data space X;
- the data space X corresponds to the data set containing the image x.
- the number of objects included in one image x is arbitrary.
- a correct label can be represented by a pair (y,b) of category y and bounding box b.
- category y is an element of category set Y, and in the example of FIG. 6, set Y is "person and bag".
- the data set D associated with the correct label is the image x and the set of all objects included in the image x group with as a set of can be expressed as
- control unit 100a includes a learning unit 101, a threshold value determination unit 102, an inference unit 103, a data set generation unit 104, and a relearning unit 105.
- the threshold determination unit 102 includes an evaluation data set inference unit 1021, an evaluation value calculation unit 1022, and a threshold determination unit 1023.
- the dataset generator 104 includes a pseudo-label generator 1041 and an association unit 1042, as shown in FIG.
- the evaluation data set inference unit 1021, the evaluation value calculation unit 1022, and the threshold determination unit 1023 correspond to the threshold determination unit 102 in exemplary embodiment 1, and are configured to implement the threshold determination means in this exemplary embodiment.
- the pseudo-label generating unit 1041 and the associating unit 1042 correspond to the dataset generating unit 104 in exemplary embodiment 1, and are configured to implement the dataset generating means in this exemplary embodiment.
- the re-learning unit 105 is a configuration that implements pseudo-label reference learning means in this exemplary embodiment.
- the learning unit 101 acquires the data set 1 (DS1), and uses the data set 1 (DS1) to learn an object detection model for pseudo label generation. That is, the learning unit 101 also functions as an acquisition unit that acquires the first data set. Specifically, the learning unit 101 reads the data set 1 (DS1) stored in the storage unit 150a, and associates a correct label with each of the data set 1 (DS1), that is, one or more images. We train an object detection model for generating pseudo-labels using the data set. Then, the learning unit 101 outputs the trained pseudo-label generation object detection model to the evaluation data set inference unit 1021 and the inference unit 103 .
- the evaluation data set inference unit 1021 generates an inference result based on the evaluation data set. Specifically, the evaluation data set inference unit 1021 acquires the evaluation data set DSE and the pseudo label generation object detection model, and converts each of the one or more images included in the evaluation data set DSE into pseudo label generation. input to the object detection model for use and obtain inference results. More specifically, the evaluation data set inference unit 1021 reads the evaluation data set DSE stored in the storage unit 150 a and inputs it to the pseudo label generation object detection model acquired from the learning unit 101 . Then, the evaluation data set inference unit 1021 acquires the inference result output by the object detection model for pseudo label generation, and outputs the inference result to the evaluation value calculation unit 1022 .
- the evaluation data set DSE is a data set in which a correct label is associated with each object included in each image, similar to the data set 1 (DS1).
- the images contained in the evaluation data set DSE may be part of the images contained in the data set 1 (DS1).
- the images included in the evaluation data set DSE may be generated by giving correct labels to part of the images included in the data set 2 (DS2).
- the images included in the evaluation data set DSE may be generated by assigning correct labels to images not included in the data set 1 (DS1) and the data set 2 (DS2). .
- the inference result by the evaluation data set inference unit 1021 is, for each of one or more images included in the evaluation data set DSE, position of each of the one or more objects inferred to be an object, size of each of the objects, and/or category of each of the objects; Contains data.
- the inference result by the evaluation data set inference unit 1021 includes category, bounding box and confidence.
- the reliability is an example of data related to the certainty of inference, and is a numerical value with 0 as the minimum value and 1 as the maximum value, for example.
- the evaluation value calculation unit 1022 calculates an evaluation value based on the inference result. Specifically, the evaluation value calculation unit 1022 compares each inference result for each of the one or more images included in the evaluation data set DSE with the correct label for each image, based on each inference result Calculate the evaluation value of
- the evaluation value is the harmonic average of the precision and recall, that is, the F value.
- F value calculation processing executed by the evaluation value calculation unit 1022 will be described.
- the evaluation value calculation unit 1022 executes the following processes (1) to (6).
- the reference value is, for example, 0.9.
- a plurality of F values are calculated in the F value calculation process. Then, the reference value becomes a different value for each F value. That is, the value 0.9 described above can be expressed as the initial value of the reference value.
- TP is an inference result in which the degree of overlap between the bounding box and the bounding box of the correct label is equal to or greater than a predetermined value and the category matches the correct label.
- FP is (A) An inference result where the category matches the correct label, but the degree of overlap between the bounding box and the bounding box of the correct label is less than or equal to a predetermined value (B) An inference result where the correct label where the bounding box overlaps and the category are different (C) Any inference result in which there is no correct label that overlaps the bounding box.
- IOU Intersection Over Union
- FN false negative
- D A correct label that does not have an inference result whose bounding box overlaps
- E A correct label whose category is different from the inference result whose bounding box overlaps.
- FIG. 7 is a graph showing the relationship between precision and recall. As shown in FIG. 7, the higher the reliability, the higher the precision, but the lower the recall. On the other hand, the lower the reliability, the lower the precision, but the higher the recall. In this way, there is a trade-off relationship between precision and recall.
- the reliability is the reference value set in the process (2).
- the evaluation value calculation unit 1022 decreases the reference value and executes the processing of (2) to (6) again. For example, the evaluation value calculation unit 1022 sets the next reference value to 0.8. In other words, the evaluation value calculator 1022 calculates the F value based on the following reference values. The evaluation value calculation unit 1022 repeats the processes (2) to (6) to calculate the F value based on each reference value. Thereby, a plurality of F values are calculated based on each of the different reference values.
- the evaluation value calculation unit 1022 repeats the processes (2) to (6) until the F value is calculated with a reference value equal to or lower than the minimum reliability.
- the F value is calculated for all inference results.
- the processes (3) and (4) are omitted, and the past You may use the specific result in the process of (3) and (4).
- the evaluation value calculation unit 1022 associates each calculated evaluation value, that is, the F value with the reference value used in calculating each F value, and outputs it to the threshold determination unit 1023 .
- the evaluation value calculation unit 1022 may calculate the precision and the recall for each category when there are multiple categories in the inference result and the correct label.
- the evaluation value calculation unit 1022 calculates the F value for each category. As a result, each reference value is associated with a plurality of F values calculated for each category.
- the evaluation value calculated by the evaluation value calculation unit 1022 is not limited to the F value.
- the evaluation value may be a value that emphasizes precision or recall.
- the evaluation value calculation unit 1022 calculates the evaluation value by, for example, ⁇ (1+ ⁇ 2 ) ⁇ relevance ⁇ recall ⁇ / ⁇ ( ⁇ 2 ⁇ relevance)+recall). can be calculated.
- ⁇ is a value for adjusting the degree of importance of precision to recall. , it becomes an evaluation value that emphasizes the relevance rate.
- the method of specifying at least part of the inference result in the process (2) when calculating a plurality of evaluation values is not limited to the above example.
- the evaluation value calculation unit 1022 may identify a predetermined number of inference results in descending order of reliability. In this example, the evaluation value calculation unit 1022 increases the predetermined number by a predetermined number each time the process (6) is completed and the next processes (2) to (6) are performed. Then, the evaluation value calculation unit 1022 repeats the processes (2) to (6) until all the inference results are specified by the process (2) and the evaluation values are calculated. It should be noted that the amount of increase in the predetermined number in the last process (2) should be 1 or more and the predetermined number or less. In this example, each calculated evaluation value is associated with the minimum reliability among the reliability of the specified inference result and output to the threshold determination unit 1023 .
- the evaluation value calculation unit 1022 For every inference result, specify TP, FP and FN. - A plurality of thresholds are set for the reliability, and the number of TPs with reliability equal to or higher than each threshold is specified. • Calculate precision and recall for each of the specified numbers of TPs. - An evaluation value (typical example: F value) is calculated for each of the calculated multiple combinations of precision and recall. and processing may be performed. Note that the number of specified TPs is proportional to the value of the recall. In this example, each calculated evaluation value is associated with the threshold value used to specify the number of TPs, and is output to the threshold determination unit 1023 .
- the threshold determination unit 1023 determines the threshold based on the evaluation value. Specifically, the threshold determination unit 1023 identifies the maximum value among the acquired F values, and sets the reference value associated with the identified F value as the threshold.
- the maximum value of the F values can be expressed as a value that balances the precision and the recall.
- the F value is calculated by a formula including the precision and the recall. Therefore, the threshold determination unit 1023 determines the threshold by referring to the precision and the recall indicated by the comparison result of the evaluation value calculation unit 1022. Then it can be expressed.
- the precision and recall at which the F value is the maximum value is the maximum precision or recall in the graph of FIG. It is not a point, but a point indicated by a star in the graph in FIG. 7, for example.
- Threshold determination section 1023 outputs the determined threshold to pseudo-label generation section 1041 .
- the threshold determination unit 1023 sets the threshold for each category. That is, the threshold determination unit 1023 determines a plurality of thresholds for each category, associates the plurality of thresholds with information indicating the corresponding category, and outputs the information to the pseudo-label generation unit 1041 .
- the inference unit 103 reads the data set 2 (DS2) stored in the storage unit 150a, and adds one or a plurality of , and obtain one or more inference results for each of the images.
- the inference unit 103 outputs the acquired inference result to the pseudo-label generation unit 1041 .
- the pseudo-label generation unit 1041 generates pseudo-labels. Specifically, the pseudo-label generation unit 1041 sets an inference result having reliability equal to or higher than the threshold determined by the threshold determination unit 1023 among one or more inference results by the inference unit 103 as a pseudo-label. The pseudo-label generation unit 1041 outputs the inference result set in the pseudo-label to the association unit 1042 .
- the pseudo-label generation unit 1041 selects one or more inference results from the inference unit 103 that have a reliability equal to or higher than the threshold set for each category. Set the result to a pseudo-label. Specifically, the pseudo-label generation unit 1041 classifies the inference result by the inference unit 103 for each category, and specifies a corresponding threshold for each classification, in other words, a threshold that matches the category. Then, the pseudo-label generation unit 1041 compares the reliability of each inference result with the specified threshold for each classification, and sets an inference result having a reliability equal to or higher than the threshold as a pseudo-label.
- the association unit 1042 associates the pseudo label set by the pseudo label generation unit 1041 with the corresponding image. This produces a dataset 2' (DS2') in which each of the one or more images contained in the dataset 2 (DS2) is associated with a pseudo-label.
- the association unit 1042 stores the generated data set 2 ′ (DS2′) in the storage unit 150 a and notifies the relearning unit 105 of it.
- FIG. 8 is a diagram showing a specific example of data included in data set 2' (DS2'). Specifically, FIG. 8 shows one of the images contained in dataset 2' (DS2'). The image is an image included in data set 2 (DS2) shown in FIG. 6, and a pseudo label is associated with each of the five objects included in the image.
- pseudo-labels are labels that include categories and bounding boxes as shown in FIG.
- the category is category information indicating the category of objects included in the image associated with the pseudo label.
- each of the three persons is associated with a "person" category
- each of the two bags is associated with a "bag” category.
- a bounding box is area information indicating the area of an object contained in an image associated with a pseudo label.
- One bounding box is associated with one object, and a typical example of the bounding box is data indicating the minimum rectangle enclosing the object, as shown in FIG.
- the re-learning unit 105 learns the detection model for the target image using the data set after the pseudo-labeling. As an example, the re-learning unit 105 re-learns the detection model learned by the learning unit 101 as learning of the target image detection model. Specifically, the relearning unit 105 reads the data set 2' (DS2') from the storage unit 150a, and uses the data set 2' (DS2') to learn the object detection model DM. Then, the relearning unit 105 stores the learned object detection model DM in the storage unit 150a. As another example, the relearning unit 105 may learn a new detection model as learning of the target image detection model, and store the new detection model in the storage unit 150a.
- the information processing apparatus 10a according to the present exemplary embodiment adopts a configuration in which the target image detection model is learned using the data set to which pseudo labels have been assigned. Therefore, according to the information processing apparatus 10a according to the exemplary embodiment, it is possible to reduce the cost of adjusting the threshold value and generate the target image detection model. Therefore, according to the information processing apparatus 10a according to the exemplary embodiment, it is possible to generate a highly accurate target image detection model while suppressing the generation cost.
- the information processing apparatus 10a in the information processing apparatus 10a according to this exemplary embodiment, a configuration is adopted in which the detection model learned by the learning unit 101 is re-learned as the learning of the target image detection model. Therefore, according to the information processing apparatus 10a according to the present exemplary embodiment, it is possible to reduce the cost of re-learning and improve the accuracy of the detection model.
- the threshold for determining whether the inference result for each of the images included in the second data set is to be the pseudo label is automatically determined. configuration is adopted. Therefore, according to the information processing apparatus 10a according to the present exemplary embodiment, it is possible to reduce the number of times of re-learning required each time the threshold is adjusted to one. As a result, the time required for re-learning can be reduced, and the time required for generation of the detection model can be reduced.
- the correct label and the pseudo label include area information and category information. Therefore, according to the information processing apparatus 10a according to the exemplary embodiment, it is possible to improve the accuracy of detecting an object included in an image using a re-learned detection model.
- the information processing apparatus 10a according to the exemplary embodiment adopts a configuration in which the threshold is determined by referring to the calculated relevance rate and recall rate. Therefore, according to the information processing apparatus 10a according to the exemplary embodiment, it is possible to improve the accuracy of pseudo label setting. Further, according to the information processing apparatus 10a according to the present exemplary embodiment, pseudo labels can be set in consideration of both the quality of learning data (precision rate) and the amount of learning data (recall rate). It is possible to obtain an effect that a highly accurate target image detection model can be generated.
- the inference result of the image included in the evaluation data set and the correct answer associated with the image are obtained by the pseudo label generation object detection model.
- a configuration may be employed in which a threshold is set for each category of labels. Therefore, according to the information processing apparatus 10a according to the present exemplary embodiment, which employs this configuration, it is possible to improve the accuracy of setting pseudo labels.
- a configuration may be adopted in which the images included in the evaluation data set DSE are included in the first data set. Therefore, according to the information processing apparatus 10a according to the present exemplary embodiment, which employs this configuration, there is no need to newly perform a high-cost correct answer assignment task in order to generate the evaluation data set DSE. effect is obtained. Further, according to the information processing apparatus 10a according to the present exemplary embodiment, which employs the configuration, it is possible to reduce the number of images to be prepared in advance.
- images included in the evaluation data set DSE are generated by giving correct labels to part of the second data set.
- a configuration may be employed. Therefore, according to the information processing apparatus 10a according to the present exemplary embodiment that employs this configuration, a part of the data set to which the pseudo label is assigned is used as the evaluation data set DSE to determine the threshold value. Therefore, it is possible to obtain an effect that the accuracy of the assigned pseudo-label can be improved. Further, according to the information processing apparatus 10a according to this exemplary embodiment, which employs the configuration, it is possible to reduce the number of images to be prepared in advance.
- FIG. 9 is a flowchart showing the flow of the information processing method S10a.
- the information processing device 10a performs the information processing method S10a to generate a second data set including images with associated pseudo-labels.
- Step S101 the learning unit 101 learns a detection model. Specifically, the learning unit 101 reads the data set 1 (DS1) stored in the storage unit 150a, and associates a correct label with each of the data set 1 (DS1), that is, one or more images. We train an object detection model for generating pseudo-labels using the data set. Then, the learning unit 101 outputs the trained pseudo-label generation object detection model to the evaluation data set inference unit 1021 and the inference unit 103 .
- step S1021 the evaluation data set inference unit 1021 generates an inference result based on the evaluation data set. Specifically, the evaluation data set inference unit 1021 reads the evaluation data set DSE stored in the storage unit 150 a and inputs it to the pseudo label generation object detection model acquired from the learning unit 101 . Then, the evaluation data set inference unit 1021 acquires the inference result output by the object detection model for pseudo label generation, and outputs the inference result to the evaluation value calculation unit 1022 .
- step S1022 the evaluation value calculator 1022 calculates an evaluation value based on the inference result. Specifically, the evaluation value calculation unit 1022 calculates the inference result specified based on the reference value among the inference results by the evaluation data set inference unit 1021 and each of one or a plurality of images included in the evaluation data set DSE. A precision rate and a recall rate are calculated based on the comparison result with the correct label in , and an F value as an evaluation value is calculated from the precision rate and the recall rate. The evaluation value calculation unit 1022 repeats calculation of the F value by changing the reference value, and calculates a plurality of F values corresponding to each reference value. The evaluation value calculation unit 1022 associates each of the calculated F values with the corresponding reference value and outputs them to the threshold determination unit 1023 .
- step S1023 the threshold determination unit 1023 determines a threshold based on the evaluation value. Specifically, the threshold determination unit 1023 identifies the maximum value among the acquired F values, and sets the reference value associated with the identified F value as the threshold. Threshold determination section 1023 outputs the determined threshold to pseudo-label generation section 1041 .
- steps S1021 to S1023 correspond to step S102 described in the first exemplary embodiment.
- Step S103 the inference unit 103 makes an inference. Specifically, the inference unit 103 reads the data set 2 (DS2) stored in the storage unit 150a, and adds the pseudo label generation object detection model acquired from the learning unit 101 to the data set 2 (DS2) Input each of the one or more images included and obtain one or more inference results for each of the images. The inference unit 103 outputs the acquired inference result to the pseudo-label generation unit 1041 .
- step S1041 the pseudo-label generation unit 1041 generates pseudo-labels. Specifically, the pseudo-label generation unit 1041 sets an inference result having reliability equal to or higher than the threshold determined by the threshold determination unit 1023 among one or more inference results by the inference unit 103 as a pseudo-label. The pseudo-label generation unit 1041 outputs the inference result set in the pseudo-label to the association unit 1042 .
- Step S1042 the associating unit 1042 associates the image with the pseudo label. Specifically, the associating unit 1042 associates each of the one or more images included in dataset 2 (DS2) with a corresponding pseudo label to generate dataset 2′ (DS2′).
- the pseudo-label generation unit 1041 stores the data set 2′ (DS2′) generated by the association unit 1042 in the storage unit 150a and notifies the relearning unit 105 of it.
- steps S1041 and S1042 correspond to step S104 described in the first exemplary embodiment.
- the re-learning unit 105 learns the target image detection model using the data set after the pseudo-labeling.
- the relearning unit 105 performs relearning of the detection model learned by the learning unit 101 as the learning. Specifically, the relearning unit 105 reads the data set 2' (DS2') from the storage unit 150a, and uses the data set 2' (DS2') to learn the object detection model DM. Then, the relearning unit 105 stores the learned object detection model DM in the storage unit 150a.
- the relearning unit 105 may learn a new detection model as learning of the target image detection model, and store the new detection model in the storage unit 150a.
- the same effects as those of the information processing apparatus 10a can be obtained. That is, in the information processing method S10a according to the present exemplary embodiment, a configuration is adopted in which the target image detection model is learned using the data set to which pseudo labels have been added. Therefore, according to the information processing method S10a according to the present exemplary embodiment, it is possible to reduce the cost of adjusting the threshold value and generate the target image detection model used by the information processing apparatus. . Therefore, according to the information processing method S10a according to this exemplary embodiment, it is possible to generate a highly accurate detection model while suppressing the generation cost.
- FIG. 10 is a block diagram showing the configuration of the information processing device 20a.
- the information processing device 20a includes a control section 200a, a storage section 250a and an output section 260a.
- the control unit 200a centrally controls each unit of the information processing device 20a.
- the storage unit 250a stores various programs and data used by the information processing device 20a.
- the output unit 260a outputs information processing results by the information processing device 20a.
- the storage unit 250a stores the target data set TDS and the object detection model DM.
- the target data set TDS is a data set containing one or more target images that are object detection targets.
- the object detection model DM is a target image detection model, specifically, an object detection model DM generated by the re-learning unit 105 of the information processing apparatus 10a.
- the object detection model DM is - a learning process for learning a detection model using the first data set; ⁇ One or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more correct answers given to each of the one or more images threshold determination processing for determining a threshold with reference to the comparison result with the label; an inference process for obtaining one or more inference results for each of the one or more images contained in the second data set by inputting each of the one or more images into the detection model; - Of the one or more inference results from the inference process, an inference result having a reliability equal to or higher than a threshold is set as a pseudo-label, and the pseudo-label is associated with the corresponding image to create a data set after pseudo-labeling and a pseudo-label reference learning process for re-learning the detection model for the target image by referring to the data set after giving the pseudo-label.
- the object detection model DM is manufactured by a method including the
- control unit 200a includes an acquisition unit 201 and a detection unit 202.
- FIG. 10 (Configuration of control unit 200a) As shown in FIG. 10, the control unit 200a includes an acquisition unit 201 and a detection unit 202.
- FIG. 10 (Configuration of control unit 200a) As shown in FIG. 10, the control unit 200a includes an acquisition unit 201 and a detection unit 202.
- FIG. 10 (Configuration of control unit 200a) As shown in FIG. 10, the control unit 200a includes an acquisition unit 201 and a detection unit 202.
- the acquisition unit 201 acquires the target image. Specifically, the acquisition unit 201 reads the target data set TDS from the storage unit 250 a and outputs it to the detection unit 202 .
- the detection unit 202 detects an object included in the target image using the target image detection model. Specifically, the detection unit 202 inputs the target image included in the target data set TDS acquired from the acquisition unit 201 to the object detection model DM, and acquires the inference result output from the object detection model DM. The detection unit 202 outputs the obtained inference result to the output unit 260a. As a result, the output unit 260a, for each target image, At least one of the presence/absence of an object included in the target image, the position of the object included in the target image, the size of the object included in the target image, and the category of the object included in the target image is output.
- the output unit 260a causes the display device to display the target image in which at least a part of the object is assigned a category and a bounding box.
- the display device may be the output unit 260a, or may be a display device (not shown) communicably connected to the information processing device 20a.
- a pseudo label is determined using an automatically determined threshold, and a data set including images associated with the pseudo label is used for learning.
- a configuration is adopted in which an object is detected using the target image detection model for which the above is performed. For this reason, according to the information processing apparatus 20a according to the present exemplary embodiment, it is possible to detect an object included in an image using a target image detection model in which the cost for adjusting the threshold value is reduced. be done.
- the information processing apparatus 20a in the information processing apparatus 20a according to the present exemplary embodiment, a configuration is adopted in which an inference result for the target image is output by the target image detection model. Therefore, according to the information processing device 20a according to the exemplary embodiment, an effect is obtained that the user of the information processing device 20a can recognize the inference result.
- FIG. 11 is a block diagram showing the configuration of the information processing device 10b.
- the information processing device 10b includes a control section 100b and a storage section 150b.
- the control unit 100b centrally controls each unit of the information processing device 10b.
- the storage unit 150b stores various programs and data used by the information processing device 10b.
- the difference between the storage unit 150b and the storage unit 150a described in the second exemplary embodiment is the data included in the data set 2 (DS2). Details of the data will be described with reference to FIG.
- FIG. 12 is a diagram showing a specific example of data included in dataset 1 (DS1) and dataset 2 (DS2). Specifically, FIG. 12 shows one of the images contained in dataset 1 (DS1) and one of the images contained in dataset 2 (DS2).
- the images contained in dataset 1 (DS1) contain five objects, specifically three dogs and two cows.
- the images contained in dataset 2 (DS2) also contain five objects, specifically two dogs and three cows.
- Data set 1 (DS1) and data set 2 (DS2) according to this exemplary embodiment are a plurality of data sets (also called expert data sets) with different categories (responsibility ranges) assigned as correct answers.
- the evaluation data set DSE is a data set including images in which correct labels are associated with dogs, similar to data set 1 (DS1).
- control unit 100b differs from the control unit 100a described in the second exemplary embodiment in that it includes an associating unit 1042b instead of the associating unit 1042.
- the pseudo-label generating unit 1041 and the associating unit 1042b correspond to the dataset generating unit 104 in exemplary embodiment 1, and are configured to implement the dataset generating means in this exemplary embodiment.
- the association unit 1042b has the following functions in addition to the functions of the association unit 1042. That is, when the object included in the image associated with the pseudo label is given a correct label, the associating unit 1042b associates the area indicated by the area information included in the pseudo label with the area included in the correct label. If the degree of overlap with the area indicated by the area information is greater than or equal to a predetermined degree, the pseudo label is deleted.
- FIG. 13 is a diagram showing specific examples of data included in data set 2 (DS2), data set 2′ (DS2′), and data set 2′′. (DS2), one of the images contained in each of dataset 2' (DS2') and dataset 2'' is shown. Note that data set 2 (DS2) according to this exemplary embodiment has already been described and will not be repeated here.
- Data set 2' is a data set in which each of the one or more images included in data set 2 (DS2) is associated with a pseudo-label, as described in exemplary embodiment 2. Since the pseudo-label can be said to be based on the data set 1 (DS1) and the evaluation data set DSE, in the example of FIG. 13, the correct label whose category is "dog" is associated with a part of the object. .
- the object Ob1 is associated with a pseudo-label including the category "dog", that is, an incorrect pseudo-label.
- the correct label is associated with the object Ob1.
- object Ob1 is associated with the correct label in addition to the pseudo label.
- the associating unit 1042b identifies the corresponding image from the images included in the dataset 2 (DS2) for each image included in the dataset 2' (DS2').
- the associating unit 1042b selects one of the images included in the data set 2′ (DS2′), and for each of the bounding boxes of the pseudo labels associated with the image, the correct label included in the identified image. Calculate the IOU with the bounding box of .
- the IOU corresponds to the degree of overlap described above.
- the associating unit 1042b performs this process for all images included in dataset 2' (DS2').
- the associating unit 1042b deletes the pseudo label when there is a correct label with an IOU equal to or greater than a predetermined value.
- the IOU between the pseudo label associated with the object Ob1 and the correct label associated with the object Ob1 is greater than or equal to a predetermined value. Therefore, the associating unit 1042b deletes the pseudo label associated with the object Ob1.
- the image included in the data set 2′′ shown in FIG. 13 is the image after the pseudo label is deleted. As shown in FIG. 13, in the image, the pseudo label associated with the object Ob1 is deleted, and the object Only correct labels are associated with Ob1.
- the pseudo label and the correct label attached to the image in the pseudo label and the correct label attached to the image, the area indicated by the area information included in the pseudo label and the area information included in the correct label is greater than or equal to a predetermined degree, the pseudo label is deleted. Therefore, according to the information processing apparatus 10b according to the present exemplary embodiment, if the pseudo label is not appropriate, the pseudo label is deleted and the correct label remains. The effect of being able to improve accuracy is obtained. Note that the pseudo-label is not appropriate, for example, (1) the category of the pseudo-label is different from the category of the object, (2) the bounding box of the pseudo-label does not enclose part of the object, etc. point to
- the information processing device 10c determines a threshold based on each of the expert datasets, and assigns a pseudo-label to each of the plurality of datasets based on the threshold.
- FIG. 14 is a block diagram showing the configuration of the information processing device 10c.
- the information processing device 10c includes a first control section 100c, a first storage section 150c, a second control section 110c, and a second storage section 160c.
- the first control unit 100c and the second control unit 110c collectively control each unit of the information processing device 10c.
- the first storage unit 150c and the second storage unit 160c store various programs and data used by the information processing device 10c.
- first control unit 100c and the second control unit 110c may be integrated.
- first storage unit 150c and the second storage unit 160c may be integrated.
- second control unit 110c and the second storage unit 160c may be provided in another device communicably connected to the information processing device 10c.
- the first storage unit 150c stores data set 1 (DS1), data set 2 (DS2), evaluation data set 1 (DSE1), and evaluation data set 2 (DSE2).
- Data set 1 (DS1) is the first data set in this exemplary embodiment.
- Data set 2 (DS2) is the second data set in this exemplary embodiment.
- Data set 1 (DS1) and data set 2 (DS2) are the expert data sets described above.
- Evaluation dataset 1 (DSE1) is the first evaluation dataset in this exemplary embodiment.
- Evaluation dataset 2 (DSE2) is the second evaluation dataset in this exemplary embodiment.
- FIG. 15 is a diagram showing a specific example of data included in dataset 1 (DS1) and dataset 2 (DS2). Specifically, FIG. 15 shows one of the images contained in dataset 1 (DS1) and one of the images contained in dataset 2 (DS2).
- Each of these images contains five objects, specifically three people and two bags.
- each of the two bags is associated with a correct label. That is, data set 1 (DS1) is an expert data set whose responsibility is "bag”.
- data set 2 (DS2) is an expert data set whose scope of responsibility is "person”.
- evaluation dataset 1 (DSE1) is a dataset in which a correct label is associated with each of the objects within the scope of responsibility included in each image.
- evaluation data set 1 (DSE1) is a data set that includes images in which correct labels are associated with bags.
- the images contained in evaluation dataset 1 (DSE1) may be part of the images contained in dataset 1 (DS1).
- the images included in the evaluation data set 1 (DSE1) are images that are not included in the data set 1 (DS1), and the correct label is associated with the object of the responsibility range in the data set 1 (DS1). It may be an image that
- dataset 2 for evaluation is a dataset in which a correct label is associated with each of the objects within the scope of responsibility included in each image.
- evaluation data set 2 (DSE2) is a data set containing images in which correct labels are associated with persons.
- the images contained in evaluation dataset 2 (DSE2) may be part of the images contained in dataset 2 (DS2).
- the images included in the evaluation data set 2 (DSE2) are images that are not included in the data set 2 (DS2), and the correct labels are associated with the objects in the scope of responsibility in the data set 2 (DS2). It may be an image that
- the first control unit 100c includes a first learning unit 101-1, a second learning unit 101-2, a first threshold determination unit 102-1, a second threshold determination unit 102 -2, a first inference unit 103-1, a second inference unit 103-2, a first data set generation unit 104-1, and a second data set generation unit 104-2.
- the first learning unit 101-1 is configured to implement the first learning means in this exemplary embodiment.
- the second learning unit 101-2 is configured to implement the second learning means in this exemplary embodiment.
- the first threshold determination unit 102-1 is a configuration that implements the first threshold determination means in this exemplary embodiment.
- the second threshold determination unit 102-2 is a configuration that implements the second threshold determination means in this exemplary embodiment.
- the first inference unit 103-1 is a configuration that implements the first inference means in this exemplary embodiment.
- the second inference unit 103-2 is a configuration that implements the second inference means in this exemplary embodiment.
- the first data set generation unit 104-1 is a configuration that implements the first data set generation means in this exemplary embodiment.
- the second data set generation unit 104-2 is a configuration that implements the second data set generation means in this exemplary embodiment.
- the first learning unit 101-1 uses the first data set to learn the first detection model. Specifically, the first learning unit 101-1 acquires the data set 1 (DS1), and uses the data set 1 (DS1) to learn the first pseudo label generation object detection model PDM1. conduct. More specifically, the first learning unit 101-1 reads the data set 1 (DS1) stored in the first storage unit 150c, and uses the data set 1 (DS1) to perform the first The object detection model PDM1 for pseudo label generation is learned. Then, the first learning unit 101-1 outputs the learned first object detection model PDM1 for pseudo label generation to the first threshold determination unit 102-1 and the first inference unit 103-1.
- the first threshold determination unit 102-1 inputs one or more inference results obtained by inputting each of one or more images included in the first evaluation data set into the first detection model, and the one or more inference results. Alternatively, the first threshold is determined by referring to the result of comparison with one or more correct labels attached to each of the plurality of images.
- the first threshold determination unit 102-1 reads the evaluation data set 1 (DSE1) stored in the first storage unit 150c, and the first threshold value acquired from the first learning unit 101-1 1 to the pseudo label generation object detection model PDM1. Then, the first threshold determination unit 102-1 acquires the inference result output by the first pseudo-label generating object detection model PDM1.
- DSE1 evaluation data set 1
- PDM1 pseudo label generation object detection model
- the first threshold determination unit 102-1 compares each inference result in each of the one or more images included in the evaluation data set 1 (DSE1) with the correct label in each of the images. Based on this, the evaluation value of each inference result is calculated.
- the evaluation value is, for example, the F value. Note that the details of the calculation process of the F value in the example where the evaluation value is the F value have been described in the second exemplary embodiment, and thus the description will not be repeated here.
- the first threshold determination unit 102-1 identifies the maximum value among the plurality of F values calculated for each reference value, and sets the reference value linked to the identified F value as the threshold. This threshold is the above-described first threshold.
- First threshold determination section 102-1 outputs the determined first threshold to first data set generation section 104-1.
- the first inference unit 103-1 inputs each of the one or more images included in the second data set to the first detection model, thereby obtaining one or more images for each of the one or more images.
- Get the inference result of Specifically, the first inference unit 103-1 reads the data set 2 (DS2) stored in the first storage unit 150c, and obtains the first pseudo data obtained from the first learning unit 101-1.
- One or more images included in the data set 2 (DS2) are input to the label generation object detection model PDM1, and one or more inference results PR1 are obtained for each of the images.
- First inference unit 103-1 outputs obtained inference result PR1 to first data set generation unit 104-1.
- First data set generation unit 104-1 sets an inference result having a reliability equal to or higher than a first threshold among one or more inference results by first inference unit 103-1 as a pseudo label, A second post-pseudo-labeled data set is generated by associating the pseudo-labels with the corresponding images. Specifically, the first data set generation unit 104-1 simulates an inference result having a reliability equal to or higher than the first threshold among the one or more inference results PR1 by the first inference unit 103-1. Set to label. The first dataset generator 104-1 then associates the pseudo-label with the corresponding image.
- the first data set generation unit 104-1 stores the generated data set 2' (DS2') in the second storage unit 160c.
- the second learning unit 101-2 learns the second detection model using the second data set. Specifically, the second learning unit 101-2 acquires the data set 2 (DS2), and uses the data set 2 (DS2) to learn the second pseudo label generation object detection model PDM2. conduct. More specifically, the second learning unit 101-2 reads the data set 2 (DS2) stored in the first storage unit 150c, and uses the data set 2 (DS2) to perform the second The object detection model PDM2 for pseudo label generation is learned. Then, the second learning unit 101-2 outputs the learned second object detection model PDM2 for pseudo label generation to the second threshold determination unit 102-2 and the second inference unit 103-2.
- the second threshold determination unit 102-2 inputs one or more inference results obtained by inputting each of one or more images included in the second evaluation data set into the second detection model, and the one or more inference results.
- the second threshold is determined by referring to the result of comparison with one or more correct labels attached to each of the plurality of images.
- the second threshold determination unit 102-2 reads the evaluation data set 2 (DSE2) stored in the first storage unit 150c, and the second threshold value obtained from the second learning unit 101-2 2 to the object detection model PDM2 for pseudo label generation. Then, the second threshold determination unit 102-2 acquires the inference result output by the second pseudo-label generation object detection model PDM2.
- DSE2 evaluation data set 2
- the second threshold determination unit 102-2 compares each inference result in each of the one or more images included in the evaluation data set 2 (DSE2) with the correct label in each of the images. Based on this, the evaluation value of each inference result is calculated.
- the evaluation value is, for example, the F value. Note that the details of the calculation process of the F value in the example where the evaluation value is the F value have been described in the second exemplary embodiment, and thus the description will not be repeated here.
- the second threshold determination unit 102-2 identifies the maximum value among the plurality of F values calculated for each reference value, and sets the reference value linked to the identified F value as the threshold. This threshold is the above-described second threshold. Second threshold determination section 102-2 outputs the determined second threshold to second data set generation section 104-12.
- the second inference unit 103-2 inputs each of the one or more images included in the first data set to the second detection model, thereby obtaining one or more images for each of the one or more images.
- Get the inference result of Specifically, the second inference unit 103-2 reads the data set 1 (DS1) stored in the first storage unit 150c, and obtains the second pseudo data obtained from the second learning unit 101-2.
- One or more images included in the data set 1 (DS1) are input to the label generation object detection model PDM2, and one or more inference results PR2 are obtained for each of the images.
- Second inference unit 103-2 outputs obtained inference result PR2 to second data set generation unit 104-2.
- the second data set generation unit 104-2 sets an inference result having a reliability equal to or higher than a second threshold among the one or more inference results by the second inference unit 103-2 as a pseudo label,
- a first post-pseudo-labeled data set is generated by associating the pseudo-labels with the corresponding images.
- the second data set generation unit 104-2 simulates an inference result having a reliability equal to or higher than the second threshold among the one or more inference results PR2 by the second inference unit 103-2.
- Second data set generator 104-2 then associates the pseudo-label with the corresponding image.
- the second data set generation unit 104-2 stores the generated data set 1' (DS1') in the second storage unit 160c.
- FIG. 16 is a diagram showing specific examples of data included in data set 1' (DS1') and data set 2' (DS2'). Specifically, FIG. 16 shows one of the images contained in data set 1' (DS1') and one of the images contained in data set 2' (DS2').
- the images included in dataset 1' (DS1') shown in FIG. 16 are the same as the images included in dataset 1 (DS1) (see FIG. 15).
- each of the two bags is associated with a correct label and each of the three persons with a pseudo-label.
- the correct label is the correct label associated with the bag that is the responsibility of the dataset 1 (DS1) in the images included in the dataset 1 (DS1) that is the source of the dataset 1′ (DS1′).
- the pseudo-label is a pseudo-label set by the second data set generator 104-2 based on the inference result PR2.
- the inference result PR2 is an inference result using the second pseudo-label generation object detection model PDM2 that has been trained using the data set 2 (DS2) whose responsibility range is a person. Associated.
- the images included in dataset 2' (DS2') shown in FIG. 16 are the same as the images included in dataset 2 (DS2) (see FIG. 15).
- each of the three persons is associated with a correct label and each of the two bags with a pseudo-label.
- the correct label is the correct label associated with the person responsible for the dataset 2 (DS2) in the images included in the dataset 2 (DS2) that is the source of the dataset 2′ (DS2′).
- the pseudo-label is a pseudo-label set by the first data set generator 104-1 based on the inference result PR1.
- the inference result PR1 is an inference result using the first pseudo-label generation object detection model PDM1 that has been trained using the data set 1 (DS1) whose responsibility range is the bag. Associated.
- the second storage unit 160c stores data set 1' (DS1'), data set 2' (DS2'), and object detection model DM.
- Data set 1' (DS1') and data set 2' (DS2') are data sets generated by the second data set generator 104-2 and the first data set generator 104-1, respectively.
- the object detection model DM is a target image detection model, and the details thereof will be described later.
- the second control unit 110c includes a re-learning unit 105.
- the relearning unit 105 is a re-learning unit.
- the re-learning unit 105 is a configuration that implements pseudo-label reference learning means in this exemplary embodiment.
- the re-learning unit 105 learns the target image detection model using the data set to which the pseudo label has been assigned. Specifically, the re-learning unit 105 re-learns the first pseudo-label generation object detection model PDM1 or the second pseudo-label generation object detection model PDM2 as the learning.
- the relearning unit 105 reads the data set 1′ (DS1′) and the data set 2′ (DS2′) from the second storage unit 160c, and reads the data set 1′ (DS1′) and data
- the set 2′ (DS2′) is used to relearn the first pseudo-label generation object detection model PDM1 or the second pseudo-label generation object detection model PDM2.
- the relearning unit 105 stores the object detection model DM generated by the relearning in the second storage unit 160c.
- the relearning unit 105 may learn a new object detection model DM using data set 1′ (DS1′) and data set 2′ (DS2′).
- the new object detection model DM is a target image detection model that is different from both the first pseudo-label generation object detection model PDM1 and the second pseudo-label generation object detection model PDM2.
- a threshold is determined based on each of a plurality of expert datasets, and a pseudo label is assigned to each of the plurality of datasets based on the threshold. is adopted.
- a plurality of datasets each assigned a pseudo-label specifically dataset 1′ (DS1′) and dataset 2′ ( DS2') can be used to re-learn the detection model, so that the effect of further improving the detection accuracy of the object included in the image using the re-learned detection model can be obtained.
- the information processing apparatus 10c when generating a plurality of data sets each assigned a pseudo label, that is, when a plurality of thresholds for determining pseudo labels are required
- a plurality of thresholds for determining pseudo labels are required
- the plurality of thresholds can be automatically determined, an effect is obtained that the cost for adjusting the thresholds can be reduced.
- the number of expert data sets is "2"
- the number of expert data sets is not limited to this example.
- the number of data sets and evaluation data sets stored in the information processing device 10c, and the number of members realizing the learning means, threshold value determination means, inference means, and data set generation means in the information processing device 10c are expert data. It depends on the number of sets. For example, when the number of expert data sets is "3", the information processing device 10c further stores a third data set and a third evaluation data set, and further includes a third learning unit, a third It further comprises a threshold determiner, a third reasoner and a third data set generator.
- the first data set generation unit 104-1 and the second data set generation unit 104-2 have the function of the associating unit 1042b described in the third exemplary embodiment. good too. That is, the first data set generation unit 104-1 generates data set 2′ (DS2′) in the case where the correct label is assigned to the object included in the image associated with the pseudo label, and the pseudo label If the degree of overlap between the region indicated by the region information included in the label and the region indicated by the region information included in the correct label is greater than or equal to a predetermined degree, the pseudo label may be deleted.
- DS2′ data set 2′
- the second data set generation unit 104-2 generates data set 1′ (DS1)′ in the case where the correct label is assigned to the object included in the image associated with the pseudo label, and the pseudo label If the degree of overlap between the region indicated by the region information included in the label and the region indicated by the region information included in the correct label is greater than or equal to a predetermined degree, the pseudo label may be deleted.
- Some or all of the functions of the information processing apparatuses 10, 10a to 10c, 20 and 20a may be realized by hardware such as integrated circuits (IC chips) or by software.
- the information processing apparatuses 10, 10a to 10c, 20, and 20a are implemented by computers that execute program instructions, which are software that implements each function, for example.
- An example of such a computer (hereinafter referred to as computer C) is shown in FIG.
- Computer C comprises at least one processor C1 and at least one memory C2.
- a program P for operating the computer C as the information processing apparatuses 10, 10a to 10c, 20 and 20a is recorded in the memory C2.
- the processor C1 reads the program P from the memory C2 and executes it, thereby realizing each function of the information processing apparatuses 10, 10a to 10c, 20 and 20a.
- processor C1 for example, CPU (Central Processing Unit), GPU (Graphic Processing Unit), DSP (Digital Signal Processor), MPU (Micro Processing Unit), FPU (Floating point number Processing Unit), PPU (Physics Processing Unit) , a microcontroller, or a combination thereof.
- memory C2 for example, a flash memory, HDD (Hard Disk Drive), SSD (Solid State Drive), or a combination thereof can be used.
- the computer C may further include a RAM (Random Access Memory) for expanding the program P during execution and temporarily storing various data.
- Computer C may further include a communication interface for sending and receiving data to and from other devices.
- Computer C may further include an input/output interface for connecting input/output devices such as a keyboard, mouse, display, and printer.
- the program P can be recorded on a non-temporary tangible recording medium M that is readable by the computer C.
- a recording medium M for example, a tape, disk, card, semiconductor memory, programmable logic circuit, or the like can be used.
- the computer C can acquire the program P via such a recording medium M.
- the program P can be transmitted via a transmission medium.
- a transmission medium for example, a communication network or broadcast waves can be used.
- Computer C can also obtain program P via such a transmission medium.
- (Appendix 1) a learning means for learning a detection model using the first data set; One or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more correct labels attached to each of the one or more images
- a threshold determination means for determining a threshold with reference to the result of comparison with inference means for obtaining one or more inference results for each of the one or more images included in the second data set by inputting each of the one or more images into the detection model; setting an inference result having a reliability equal to or higher than the threshold among the one or more inference results by the inference means as a pseudo-label, and associating the pseudo-label with the corresponding image to obtain a data set after pseudo-labeling; and data set generation means for generating a data set.
- the inference result of the image included in the evaluation data set by the detection model trained using the first data set is compared with the correct label associated with the image. Based on this, the threshold value for pseudo-label setting is automatically determined. Therefore, according to the configuration of Supplementary Note 1, it is possible to reduce the cost for adjusting the threshold. Then, according to the configuration of Supplementary Note 1, from the inference result of the image included in the second data set by the detection model, the inference result having a confidence level equal to or higher than the automatically determined threshold is set as the pseudo label, Associate the pseudo-label with the corresponding image. Therefore, according to the configuration of Supplementary Note 1, it is possible to reduce the cost of generating a data set including images to which pseudo labels are assigned.
- An information processing apparatus further comprising pseudo-label reference learning means for learning a target image detection model for detecting an object included in the target image, using the data set to which the pseudo label has been assigned.
- Appendix 3 The information processing device according to appendix 2, The information processing apparatus, wherein the pseudo label reference learning means re-learns the detection model as the learning of the target image detection model.
- the detection model is re-learned using the data set after pseudo-labeling. Therefore, according to the configuration of Supplementary Note 3, it is possible to reduce the cost of re-learning the detection model. Also, if an appropriate value can be determined as the threshold, the number of times the threshold is adjusted can be reduced, and the number of re-learning of the detection model required each time the threshold is adjusted can be reduced. As a result, it is possible to reduce the time until re-learning of the detection model is completed.
- the information processing device includes area information indicating the area of the object included in the image associated with the correct label, and category information indicating the category of the object
- the correct label and the pseudo label include area information and category information. Therefore, according to the configuration of Supplementary Note 4, it is possible to improve the accuracy of detecting an object included in an image using a detection model that has been re-learned using a data set after pseudo-labeling. .
- the information processing device according to appendix 4, At least part of the one or more images included in the second data set is labeled with one or more correct labels,
- the data set generation means is When a correct label is assigned to an object included in an image associated with the pseudo label, the area indicated by the area information included in the pseudo label and the area indicated by the area information included in the correct label 2.
- the pseudo label in the pseudo labels and the correct labels attached to the images included in the second data set, the area indicated by the area information included in the pseudo label and the area indicated by the area information included in the correct label If the degree of overlap with is greater than or equal to a predetermined degree, the pseudo label is deleted. For this reason, according to the configuration of Supplementary Note 5, when the pseudo label is not appropriate, the pseudo label is deleted and the correct label remains. Therefore, the accuracy of object detection using the retrained detection model is improved. becomes possible. In particular, in the case of datasets in which the correct labels are attached to visually similar objects, there is a high probability that pseudo-labels with incorrect categories will be associated with the objects. On the other hand, according to the configuration of Supplementary Note 5, since the erroneous pseudo label can be deleted, the pseudo label can be generated with high accuracy, and the object detection accuracy using the target image detection model can be improved. can be improved.
- the pseudo-label is not appropriate, for example, (1) the category of the pseudo-label is different from the category of the object, (2) the bounding box of the pseudo-label does not enclose part of the object, etc. point to
- the information processing device according to appendix 4 or 5
- the threshold determination means sets the threshold for each category
- the data set generation means is Among the one or more inference results by the inference means, an inference result having a reliability level equal to or higher than the threshold value set for each category is set as a pseudo label, and the pseudo label is associated with the corresponding image to generate a pseudo An information processing device that generates a data set after labeling.
- the inference result of the image included in the evaluation data set by the detection model trained using the first data set and the correct label associated with the image for each category A threshold is set for , and an inference result equal to or greater than the threshold is set as a pseudo label. Therefore, according to the configuration of Supplementary Note 6, it is possible to improve the accuracy of setting pseudo labels.
- Appendix 7 The information processing device according to any one of appendices 1 to 6, The information processing apparatus, wherein the threshold determination means determines the threshold by referring to the matching rate and the recall rate indicated by the comparison result.
- the threshold is determined by referring to the precision and recall calculated from . Therefore, according to the configuration of Supplementary Note 7, it is possible to improve the accuracy of setting pseudo labels. In addition, according to the configuration of Supplementary Note 7, it is possible to set pseudo labels in consideration of both the quality (precision rate) of learning data and the amount (recall rate) of learning data, so highly accurate target image detection model can be generated.
- Appendix 8 The information processing device according to any one of Appendices 1 to 7, The information processing apparatus, wherein the evaluation data set is included in the first data set.
- the images included in the evaluation data set are included in the first data set. Therefore, according to the configuration of Supplementary Note 8, there is no need to newly perform a high-cost correct answer work for generating the evaluation data set. Further, according to the configuration of Supplementary Note 8, it is possible to reduce the number of images prepared in advance.
- Appendix 9 The information processing device according to any one of Appendices 1 to 7, The information processing apparatus, wherein the evaluation data set is generated by giving a correct label to a part of the second data set.
- the images included in the evaluation data set are generated by giving correct labels to part of the second data set. Therefore, according to the configuration of Supplementary Note 9, a part of the data set to which the pseudo-label is assigned is used as the evaluation data set to determine the threshold value, so the accuracy of the assigned pseudo-label is improved. It is possible to Further, according to the configuration of Supplementary Note 9, it is possible to reduce the number of images prepared in advance.
- the inference result of the image included in the first evaluation data set by the detection model trained using the first data set, and the correct label associated with the image. automatically determine a first threshold for setting pseudo-labels to the second data set based on the comparison of .
- the inference result of the image included in the second evaluation data set by the detection model trained using the second data set, and the correct answer associated with the image A second threshold for setting the pseudo-labels to the first data set is automatically determined based on the comparison with the labels. Therefore, according to the configuration of Supplementary Note 10, it is possible to reduce the cost for adjusting the first threshold and the second threshold.
- a pseudo label is determined using an automatically determined threshold, and a target image detection model trained using a data set including images associated with the pseudo label is used. to detect objects included in the target image. Therefore, according to the configuration of Supplementary Note 11, it is possible to detect an object included in the target image using the target image detection model in which the cost for adjusting the threshold value is reduced.
- (Appendix 12) a learning step of learning a detection model using the first data set; One or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more correct labels attached to each of the one or more images
- the target image detection model includes: A learning process for learning a detection model using the first data set; One or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more correct labels attached to each of the one or more images A threshold determination process for determining a threshold with reference to the result of comparison with Inference processing for obtaining one or more inference results for each of the one or more images by inputting each of the one or more images included in the second data set into the detection model; By setting an inference result having a reliability equal to or higher than the threshold among the one or more inference results by the inference process as a pseudo label and associating the pseudo label with the corresponding image, the data set after pseudo labeling and a pseudo-label reference learning process for learning the target image detection model by referring to the data set after the pseudo-labeling. Processing method.
- (Appendix 14) a learning step of learning a detection model using the first data set; One or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more correct labels attached to each of the one or more images
- the inference result of the image included in the evaluation data set by the detection model trained using the first data set is compared with the correct label associated with the image. Based on this, the threshold value for pseudo-label setting is automatically determined. Therefore, according to the configuration of Supplementary Note 14, it is possible to reduce the cost for adjusting the threshold. Then, according to the configuration of Supplementary Note 14, learning of the target image detection model is performed using the data set to which the pseudo label has been assigned. Therefore, according to the configuration of Supplementary Note 14, it is possible to manufacture the detection model for the target image while reducing the cost for adjusting the threshold value. As a result, it is possible to reduce the cost of learning the target image detection model.
- the number of times the threshold value is adjusted can be reduced, and the number of times of learning required each time the threshold value is adjusted can be reduced. As a result, it is possible to reduce the time until the learning of the target image detection model is completed.
- a program for causing a computer to function as an information processing device comprising: a learning means for learning a detection model using the first data set; One or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more correct labels attached to each of the one or more images
- a program for causing a computer to function as an information processing device comprising: acquisition means for acquiring a target image; detection means for detecting an object included in the target image using a target image detection model; function as The target image detection model includes: A learning process for learning a detection model using the first data set; One or more inference results obtained by inputting each of one or more images included in the evaluation data set into the detection model, and one or more correct labels attached to each of the one or more images A threshold determination process for determining a threshold with reference to the result of comparison with Inference processing for obtaining one or more inference results for each of the one or more images by inputting each of the one or more images included in the second data set into the detection model; By setting an inference result having a reliability equal to or higher than the threshold among the one or more inference results by the inference process as a pseudo label and associating the pseudo label with the corresponding image, the data set after pseudo labeling and a pseudo-label reference learning process for learning the target image detection model by
- At least one processor is provided, and the processor inputs each of one or more images included in the evaluation data set into the detection model, and a learning process of learning a detection model using a first data set.
- a threshold determination process for determining a threshold with reference to a comparison result between one or more inference results obtained by the method and one or more correct labels attached to each of the one or more images; inference processing for obtaining one or more inference results for each of the one or more images by inputting each of the one or more images included in the set into the detection model;
- Data set generation for generating a data set after assigning a pseudo label by setting an inference result having a confidence level equal to or higher than the threshold as a pseudo label among a plurality of inference results and associating the pseudo label with the corresponding image.
- An information processing device that executes a process.
- the information processing apparatus may further include a memory, in which the learning process, the threshold value determination process, the inference process, and the data set generation process are executed by the processor.
- a program may be stored for causing the Also, this program may be recorded in a computer-readable non-temporary tangible recording medium.
- At least one processor is provided, and the processor performs an acquisition process of acquiring a target image and a detection process of detecting an object included in the target image using a target image detection model, and the target
- the image detection model includes a learning process for learning the detection model using the first data set, and one or more images obtained by inputting each of one or more images included in the evaluation data set into the detection model.
- a data set generation process for generating a data set after the pseudo-labeling by setting an inference result having a reliability equal to or higher than a threshold as a pseudo-label and associating the pseudo-label with the corresponding image, and after the pseudo-labeling is learned by a pseudo-label reference learning process for learning the target image detection model by referring to the data set of .
- the information processing apparatus may further include a memory, and the memory may store a program for causing the processor to execute the acquisition process and the detection process. Also, this program may be recorded in a computer-readable non-temporary tangible recording medium.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Image Analysis (AREA)
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2021/008696 WO2022185531A1 (ja) | 2021-03-05 | 2021-03-05 | 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム |
| PCT/JP2022/005877 WO2022185899A1 (ja) | 2021-03-05 | 2022-02-15 | 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム |
| JP2023503690A JP7582442B2 (ja) | 2021-03-05 | 2022-02-15 | 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2021/008696 WO2022185531A1 (ja) | 2021-03-05 | 2021-03-05 | 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2022185531A1 true WO2022185531A1 (ja) | 2022-09-09 |
Family
ID=83154107
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2021/008696 Ceased WO2022185531A1 (ja) | 2021-03-05 | 2021-03-05 | 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム |
| PCT/JP2022/005877 Ceased WO2022185899A1 (ja) | 2021-03-05 | 2022-02-15 | 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2022/005877 Ceased WO2022185899A1 (ja) | 2021-03-05 | 2022-02-15 | 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム |
Country Status (2)
| Country | Link |
|---|---|
| JP (1) | JP7582442B2 (https=) |
| WO (2) | WO2022185531A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116343050B (zh) * | 2023-05-26 | 2023-08-01 | 成都理工大学 | 基于自适应权重的遥感图像噪声标注的目标检测方法 |
| CN120125932A (zh) * | 2025-02-10 | 2025-06-10 | 蚂蚁区块链科技(上海)有限公司 | 一种基于关键遗忘机制的伪造图像检测模型的训练方法 |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020225923A1 (ja) * | 2019-05-09 | 2020-11-12 | 日本電信電話株式会社 | 分析装置、分析方法及び分析プログラム |
| US20200410388A1 (en) * | 2019-06-25 | 2020-12-31 | International Business Machines Corporation | Model training using a teacher-student learning paradigm |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2020197974A (ja) * | 2019-06-04 | 2020-12-10 | 日本電気通信システム株式会社 | 状況認識装置、状況認識方法、および状況認識プログラム |
| KR102283283B1 (ko) * | 2019-07-26 | 2021-07-28 | 주식회사 수아랩 | 데이터 레이블링 우선순위 결정방법 |
-
2021
- 2021-03-05 WO PCT/JP2021/008696 patent/WO2022185531A1/ja not_active Ceased
-
2022
- 2022-02-15 JP JP2023503690A patent/JP7582442B2/ja active Active
- 2022-02-15 WO PCT/JP2022/005877 patent/WO2022185899A1/ja not_active Ceased
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020225923A1 (ja) * | 2019-05-09 | 2020-11-12 | 日本電信電話株式会社 | 分析装置、分析方法及び分析プログラム |
| US20200410388A1 (en) * | 2019-06-25 | 2020-12-31 | International Business Machines Corporation | Model training using a teacher-student learning paradigm |
Non-Patent Citations (2)
| Title |
|---|
| "arXiv.org", vol. 25, 1 January 1900, CORNELL UNIVERSITY LIBRARY,, 201 Olin Library Cornell University Ithaca, NY 14853, article MEI KE; ZHU CHUANG; ZOU JIAQI; ZHANG SHANGHANG: "Instance Adaptive Self-training for Unsupervised Domain Adaptation", pages: 415 - 430, XP047593075, DOI: 10.1007/978-3-030-58574-7_25 * |
| RHEE HOCHANG; CHO NAM IK: "Efficient and Robust Pseudo-Labeling for Unsupervised Domain Adaptation", 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), IEEE, 18 November 2019 (2019-11-18), pages 980 - 985, XP033733147, DOI: 10.1109/APSIPAASC47483.2019.9023239 * |
Also Published As
| Publication number | Publication date |
|---|---|
| JPWO2022185899A1 (https=) | 2022-09-09 |
| WO2022185899A1 (ja) | 2022-09-09 |
| JP7582442B2 (ja) | 2024-11-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111523621B (zh) | 图像识别方法、装置、计算机设备和存储介质 | |
| US11551079B2 (en) | Generating labeled training images for use in training a computational neural network for object or action recognition | |
| US11741361B2 (en) | Machine learning-based network model building method and apparatus | |
| US10410111B2 (en) | Automated evaluation of neural networks using trained classifier | |
| CN111063410B (zh) | 一种医学影像文本报告的生成方法及装置 | |
| US20200202210A1 (en) | Systems and methods for training a neural network | |
| US20180174062A1 (en) | Root cause analysis for sequences of datacenter states | |
| CN112561073A (zh) | 使用基于批次的主动学习方案训练机器学习模型 | |
| CN112836753B (zh) | 用于域自适应学习的方法、装置、设备、介质和产品 | |
| CN108257670B (zh) | 医学解释模型的建立方法和装置 | |
| CN112085056A (zh) | 目标检测模型生成方法、装置、设备及存储介质 | |
| US20200320440A1 (en) | System and Method for Use in Training Machine Learning Utilities | |
| WO2014176056A2 (en) | Data classification | |
| WO2022185899A1 (ja) | 情報処理装置、情報処理方法、検知モデルの製造方法、およびプログラム | |
| JP2021009645A (ja) | 属性認識システム、学習サーバ、及び属性認識プログラム | |
| US20220269718A1 (en) | Method And Apparatus For Tracking Object | |
| CN109886343A (zh) | 图像分类方法及装置、设备、存储介质 | |
| US20200250544A1 (en) | Learning method, storage medium, and learning apparatus | |
| CN114503131A (zh) | 检索装置、检索方法、检索程序和学习模型检索系统 | |
| CN115374950A (zh) | 样本检测方法、样本检测装置、电子设备及存储介质 | |
| US20210049513A1 (en) | Unsupervised model evaluation method, apparatus, server, and computer-readable storage medium | |
| US11556760B2 (en) | Learning-based data processing system and model update method | |
| JPWO2016084326A1 (ja) | 情報処理システム、情報処理方法、及び、プログラム | |
| CN111858999A (zh) | 一种基于分段困难样本生成的检索方法及装置 | |
| US12073643B2 (en) | Machine learning apparatus, machine learning method, and computer-readable recording medium |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21929103 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 21929103 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: JP |