WO2019215994A1 - Image determination system, image determination method, and image determination program - Google Patents

Image determination system, image determination method, and image determination program Download PDF

Info

Publication number
WO2019215994A1
WO2019215994A1 PCT/JP2019/006777 JP2019006777W WO2019215994A1 WO 2019215994 A1 WO2019215994 A1 WO 2019215994A1 JP 2019006777 W JP2019006777 W JP 2019006777W WO 2019215994 A1 WO2019215994 A1 WO 2019215994A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
type
threshold
indistinguishable
images
Prior art date
Application number
PCT/JP2019/006777
Other languages
French (fr)
Japanese (ja)
Inventor
徳寿 伊賀
Original Assignee
日本電気株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電気株式会社 filed Critical 日本電気株式会社
Priority to JP2020518152A priority Critical patent/JP6969678B2/en
Publication of WO2019215994A1 publication Critical patent/WO2019215994A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis

Definitions

  • the present invention relates to an image determination system, an image determination method, and an image determination program for determining the type of image.
  • Patent Document 1 describes an automatic discrimination method for image types.
  • the type of image is broadly classified into a monochrome image and a color image, and further, the monochrome image is converted into a monochrome photograph, a word processor / newspaper, and a tie-point photograph. Subdivide each.
  • an image with a known image type is read to create a digital image with a known image type, and an upper limit of saturation of the image, a fractal dimension of brightness and saturation are calculated, and automatic A threshold for determination is determined.
  • the threshold value is set too low, there is a high possibility that a determination error will be induced, and there is a possibility that the work for manual confirmation and the work for correction will increase.
  • the threshold value is set high from the initial stage, there is a high possibility that the image will not be discriminated by any type, and there is a possibility that the work of manually setting the correct answer label may increase.
  • the threshold value is uniquely determined according to the nature of the image type.
  • the assumed personnel may not be able to confirm the content of the images, and the overall processing may be incomplete. .
  • the reference for automatically determining the image can be adjusted in accordance with the situation of personnel that can be handled while suppressing the accuracy of automatically determining the image from becoming too low.
  • An object of the present invention is to provide an image determination system, an image determination method, and an image determination program capable of adjusting a reference for determining an image according to the situation of a person while suppressing a decrease in accuracy of determining an image.
  • An image determination system includes: a determination unit that determines the type of an image for which the likelihood of the type output by the estimator that estimates the likelihood of the type of the input image exceeds a set threshold; A threshold setting unit to be set, and the threshold setting unit can be assigned to determine the number of indistinguishable estimated images that are estimated to be the type that the discriminating unit cannot discriminate, and the type of the indistinguishable estimated image
  • the threshold value is set according to the number of.
  • the image determination method determines the type of an image in which the likelihood of the type output by the estimator that estimates the likelihood of the type of the input image exceeds a set threshold, and cannot determine the type.
  • the threshold value is set according to the number of estimated non-discriminable images that are estimated images and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimated image.
  • An image determination program is a determination process for determining the type of an image in which the likelihood of the type output by the estimator that estimates the likelihood of the type of image input to the computer exceeds a set threshold value,
  • the threshold setting process for setting the threshold is executed, and in the threshold setting process, the number of indistinguishable estimation images that are estimated to be the type that cannot be discriminated by the discrimination process and the type of the indistinguishable estimation image
  • a threshold value is set according to the number of assignable personnel.
  • the present invention it is possible to adjust the reference for discriminating the image according to the situation of the personnel while suppressing the reduction of the accuracy of discriminating the image.
  • FIG. FIG. 1 is a block diagram showing a configuration example of a first embodiment of an image determination system according to the present invention.
  • the image determination system 100 of the present embodiment includes an image input unit 10, an image type estimation unit 20, a discriminator 30, a relearning data storage unit 31, a test data storage unit 32, and a type setting unit 40.
  • the learning device 50, the evaluation unit 60, and the output unit 70 are provided.
  • the output unit 70 outputs various results by the image determination system.
  • the output unit 70 is realized by, for example, a display device or a printer.
  • the image input unit 10 inputs an image used by the image type estimation unit 20 to estimate the type.
  • the image input unit 10 may input an image via a communication network, or may read an image stored in a storage unit (not shown) of another system.
  • the image type estimation unit 20 estimates the likelihood of the type of the input image using a model that estimates the likelihood of each type represented by the image (hereinafter simply referred to as a model or a learned model).
  • the method of expressing the probability of the type is arbitrary, and may be, for example, the probability of the type or the reliability for each type.
  • the type setting method used for image classification is arbitrary.
  • the type may be set for each different type of image, or may be set for the same type of image according to the degree of the object represented by the image.
  • Examples of settings for different types of images include different types of forms (resident card, marriage registration, seal certificate, etc.), maps (road maps, topographic maps, etc.), floor plans (residential, office, etc.) ) Is set.
  • the setting is made according to the degree of deterioration of the same building or the degree of progression of lesions.
  • the mode of the model used by the image type estimation unit 20 is arbitrary, for example, a model generated by deep learning.
  • the image type estimation unit 20 may estimate the probability for each type by applying the input image to the model. For example, when there are four types of types of image A, that is, types A to D, and the type A is most likely, the image type estimation unit 20 sets “type A: 85%” and “type B: 10 for each type. The probability may be estimated as “%”, “Category C: 3%”, or “Category D: 2%”. A method for learning a model used by the image type estimation unit 20 will be described later.
  • the discriminator 30 discriminates the image type based on the probability of the image type estimated by the image type estimation unit 20. Specifically, the discriminator 30 discriminates image types whose probability estimated by the model used by the image type estimation unit 20 exceeds a threshold value.
  • a value may be set for each type of threshold value used for determination, or a common value may be set for the entire type. For example, a threshold that is higher than the thresholds of other types may be set for a type that is easy to estimate.
  • the threshold value used by the discriminator 30 is updated by the evaluation unit 60 described later.
  • a threshold value may be set based on the user's experience or the like or an existing estimated probability.
  • a threshold that satisfies the estimated probability may be set by estimating the probability of the type of image to which the correct data is assigned using the current model. A method for updating the threshold will be described later.
  • the discriminator 30 registers the image whose type has been discriminated (that is, the image whose type has been discriminated) in the test data storage unit 32. Further, the discriminator 30 may transmit an image whose type has been discriminated to subsequent processing. For example, the discriminator 30 may cause the output unit 70 to output an image with the type discriminated together with the type, or may notify the image to a processing unit (not shown) that performs subsequent processing corresponding to the type. .
  • the discriminator 30 registers an image whose type could not be discriminated (hereinafter referred to as an indistinguishable image) in the relearning data storage unit 31 when the type of the image could not be discriminated.
  • the registered image is used in the subsequent type setting unit 40.
  • the discriminator 30 may directly transmit the indistinguishable image to the type setting unit 40 without registering it in the relearning data storage unit 31.
  • the type setting unit 40 assigns a type that is correct answer data (label) to an image whose type cannot be determined by the classifier 30 (that is, an image that cannot be determined). For example, the type setting unit 40 may display an indistinguishable image for the user and manually add correct data. In addition, since the indistinguishable image to which correct data is assigned is used as teacher data in subsequent relearning processing, it is preferable that correct data is assigned with high accuracy. Further, since the image with the correct answer label can be used as test data, the type setting unit 40 stores this image in the test data storage unit 32.
  • the learning device 50 learns a model used in the discriminator 30 (that is, a model for estimating the probability for each type represented by the image) using the learning image data.
  • the learning device 50 may learn a model using, for example, existing image data as learning data.
  • the learning device 50 re-learns the model by using the data with the correct answer label assigned to the indistinguishable image as learning image data.
  • a non-discriminable image can be said to be an image for which the model has not been able to estimate the type accuracy at a high level, or an image for which the type could not be estimated with the accuracy required by the discriminator 30 (that is, the accuracy of the set threshold). It is possible to improve the accuracy with which the re-learned model estimates the type by generating learning image data to which a correct answer label is assigned to such an image and performing relearning based on the learning image data. It becomes possible.
  • the method for the relearning by the learning device 50 is arbitrary. For example, if the type does not increase, the learning device 50 may perform additional learning using the learning image data, or the learning data may be an image used for learning in the past and an indistinguishable image provided with a correct answer label. May be used to learn a new model.
  • the data used by the learning device 50 for re-learning can also be selected according to the learning method.
  • the learning device 50 may perform relearning of the model using, for example, all the images that cannot be discriminated to which the correct answer label is assigned as learning image data. Further, in order to shorten the learning time and enhance the estimation accuracy for a specific type, the learning device 50 uses only a specific type of image as learning image data among non-distinguishable images assigned with correct labels. Also good.
  • the estimation accuracy of a type with a low probability of being estimated to be applicable is improved.
  • the estimated probability of type D is 51%.
  • the learning device 50 may perform relearning using learning image data in which the type (for example, type D) is set as the correct answer label.
  • the learning device 50 may perform relearning using image data of the type that is set most (or most frequently) among the images that cannot be discriminated. This is because such types of images are considered to have low estimation accuracy. By performing relearning using such an image, it is possible to generate a model that can accurately estimate the type of a type with low estimation accuracy.
  • the timing at which the learning device 50 performs relearning is also arbitrary.
  • the learning device 50 may perform relearning after a lapse of a predetermined period, or may perform relearning when a predetermined number of pieces of learning image data have accumulated so that accuracy can be ensured.
  • the learning device 50 may perform relearning when the ratio of images whose type cannot be determined by the determination device 30 (hereinafter referred to as an undistinguishable ratio) does not satisfy a predetermined criterion.
  • the learning device 50 calculates, for example, the ratio of the number of images whose type could not be determined with respect to the total number of images to be determined as an indistinguishable ratio, and re-enters when the indistinguishable ratio exceeds a predetermined reference. You may learn. This is preferable because, for example, when the tendency of the input image changes, it becomes possible to automatically relearn.
  • the evaluation unit 60 evaluates the accuracy of the model learned by the learning device 50. Then, the evaluation unit 60 compares the accuracy of the existing model (the model before relearning) with the model after the relearning, and selects a model with higher accuracy. The evaluation unit 60 may automatically replace the selected model as a model used by the image type estimation unit 20, or may replace it according to an instruction from a user or the like.
  • the evaluation unit 60 may evaluate the accuracy of the model based on a single index, or may evaluate the accuracy of the model by comprehensively judging a plurality of indices. At that time, the evaluation unit 60 may evaluate the accuracy of the model by using both an image whose type is data stored in the test data storage unit 32 and an unidentifiable image. The accuracy of the model may be evaluated using this data. For example, the evaluation unit 60 discriminates an indistinguishable image in parallel between an existing model and a model after re-learning, and the probability of each type is higher in the model after re-learning than in the existing model. In such a case, the model after re-learning may be selected by determining that the accuracy has increased.
  • the evaluation unit 60 may calculate the number (or rate) at which discrimination is possible as a result of discriminating images determined to be indistinguishable with the existing model with the model after relearning. Then, when the calculated number (or ratio) satisfies a predetermined criterion, the evaluation unit 60 may determine that the model after re-learning has become a more accurate model and select the model. .
  • the evaluation unit 60 applies a plurality of test data to both the existing model and the model after relearning, calculates the certainty for each type, and statistically processes the evaluation result for each test data (for example, The average value of the certainty may be calculated, and the standard deviation of the certainty may be calculated). For example, when the average value is calculated as the statistical process, the evaluation unit 60 may select a model having a high probability average value as a more accurate model. For example, when the standard deviation is calculated as the statistical process, the evaluation unit 60 may select a model with a small standard deviation of probability (that is, less evaluation blur) as a more accurate model.
  • the test data used for the statistical processing may be all the past data of the learned type, or may be data of the number selected in advance.
  • the evaluation unit 60 may update the threshold used by the discriminator 30 when updating the model used by the image type estimation unit 20.
  • the degree of changing the threshold value during the update may be determined in advance, and the evaluation unit 60 may update the threshold value according to the degree.
  • the predetermined degree may be for each type or may be common throughout.
  • the evaluation unit 60 may determine the degree to increase the threshold according to the improved accuracy. For example, the change in the past non-discrimination ratio when the threshold value is changed in accordance with the improved accuracy is learned, and the evaluation unit 60 determines from the relationship between the improved accuracy and the improved threshold value and the non-discrimination ratio, The degree to which the threshold value is changed may be determined.
  • the evaluation unit 60 may increase the threshold according to a predetermined allowable discrimination impossible ratio.
  • the re-learning data storage unit 31 stores indistinguishable images.
  • the relearning data storage unit 31 may store the indistinguishable image in association with the correct label given by the type setting unit 40.
  • the test data storage unit 32 stores the type determined or added in association with the image.
  • the relearning data storage unit 31 and the test data storage unit 32 are realized by, for example, a magnetic disk device.
  • the image input unit 10, the image type estimation unit 20, the discriminator 30, the type setting unit 40, the learning unit 50, and the evaluation unit 60 are a CPU (Central Processing) of a computer that operates according to a program (image determination program). Realized by Unit IV).
  • the program is stored in a storage unit (not shown) included in the image determination system, and the CPU reads the program, and in accordance with the program, the image input unit 10, the image type estimation unit 20, the discriminator 30, and the type setting unit. 40, the learning device 50, and the evaluation unit 60 may be operated.
  • the image input unit 10, the image type estimation unit 20, the discriminator 30, the type setting unit 40, the learning device 50, and the evaluation unit 60 may be realized by dedicated hardware, respectively.
  • FIG. 2 is a flowchart illustrating an operation example of the image determination system according to the first embodiment. It is assumed that the model used by the image type estimation unit 20 for estimation in the initial stage is learned in advance.
  • the image input unit 10 inputs an image to be determined (step S11).
  • the image type estimation unit 20 applies a model to the input image, and estimates the likelihood of the type of the input image (step S12).
  • the discriminator 30 discriminates the image type based on the estimated likelihood of the image type (step S13).
  • step S13 If the image type cannot be determined (No in step S13), the type setting unit 40 assigns correct data to the indistinguishable image (step S14) and stores it in the relearning data storage unit 31. Thereafter, the process of step S15 is performed.
  • the discriminator 30 when the type of the image can be discriminated (Yes in step S13), the discriminator 30 notifies the image to a processor (not shown) that performs subsequent processing corresponding to the type (step S15). The discriminator 30 stores the discriminated type in the test data storage unit 32 in association with the image.
  • the learning device 50 re-learns the model by using the data to which the correct label is assigned to the indistinguishable image as the learning image data (step S16).
  • the learning device 50 performs relearning at a timing and method according to a predetermined condition.
  • the evaluation unit 60 evaluates the accuracy of the model learned by the learning device 50 and determines whether the accuracy is higher than that of the existing model (step S17). When the accuracy is high (Yes in step S17), the evaluation unit 60 replaces the existing model used for estimation by the image type estimation unit 20 with a model after re-learning (step S18). Further, the evaluation unit 60 updates the threshold used by the discriminator 30 for discrimination (step S19). On the other hand, if the accuracy is not high (No in step S17), the process is terminated with the existing model as it is.
  • FIG. 3 is an explanatory diagram showing a specific example of the operation of the image determination system of the present embodiment.
  • AI Artificial Intelligence
  • FIG. 3 corresponds to the image type estimation unit 20 and estimates the probabilities of type A, type B, type C, and type D using a model.
  • the discriminator 30 compares the probability of the estimated type with a set threshold value, and identifies a type of probability that is greater than the threshold value.
  • an image for which a type greater than the threshold (80%) is specified is classified as discriminated 30x, and an image for which no type is specified is classified as indistinguishable 30y.
  • the user sets types A to D via the type setting unit 40 for images classified as indistinguishable 30y (that is, indistinguishable images). This data is used as relearning data.
  • the image classified into the discriminated 30x and the indistinguishable image in which the type is set are input to the processors Pa to Pd that perform the respective processes, and each process is performed. Further, these images are stored in the storage device 33 for the discriminated image.
  • the storage device 33 corresponds to the test data storage unit 32.
  • the AI 51 performs relearning of the model using the relearning data, and generates a relearned model.
  • the AI 51 corresponds to the learning device 50 and performs relearning with a timing and method according to a predetermined condition.
  • the relearning model evaluator 61 compares the accuracy of the existing model pm and the model rm after the relearning, and selects a model with higher accuracy. When the model rm after re-learning is selected, the evaluator 61 replaces the model used by the AI 21. In addition, the evaluator 61 changes the threshold used by the discriminator 30.
  • the evaluator 61 corresponds to the evaluation unit 60.
  • the learning device 50 re-learns the model using the indistinguishable image as learning data, and the evaluation unit 60 determines the accuracy of the model before re-learning and the model after re-learning. Compare and select a model with higher accuracy. Therefore, the accuracy of determining an image according to use can be improved. That is, since a model with higher accuracy is selected according to use, it is possible to improve accuracy by determining an image using the model.
  • Embodiment 2 a second embodiment of the image determination system according to the present invention will be described.
  • the image type can be determined accurately within the limited resources. That is, in a situation where a large number of personnel can be secured, it is preferable that the correct label can be assigned by manually determining the type including not only images that are difficult to determine but also images that are delicately determined.
  • the number of images that cannot be discriminated can be reduced by automatically discriminating images that can be discriminated with an acceptable accuracy based on the model.
  • the threshold value is changed according to the model update.
  • a method for adjusting a threshold which is a reference for discriminating an image, according to the situation of personnel that can be secured for assigning a correct answer label to an indistinguishable image.
  • FIG. 4 is a block diagram showing a configuration example of the second embodiment of the image determination system according to the present invention.
  • the image determination system 200 includes an image input unit 10, an image type estimation unit 20, a discriminator 30, a relearning data storage unit 31, a test data storage unit 32, and a type setting unit 40. , A learning device 50, an evaluation unit 60, a threshold setting unit 62, a storage unit 63, and an output unit 70.
  • the image determination system 200 of the present embodiment further includes a threshold setting unit 62 and a storage unit 63 as compared with the image determination system 100 of the first embodiment.
  • Other configurations are the same as those in the first embodiment.
  • the evaluation unit 60 evaluates the accuracy of the model learned by the learning device 50, and determines the accuracy of the existing model (the model before relearning) and the model after the relearning. Compare and select a model with higher accuracy.
  • the threshold setting unit 62 updates the threshold used by the discriminator 30 will be described.
  • the evaluation unit 60 may update the threshold as in the first embodiment.
  • the storage unit 63 stores the correspondence between the ratio of the images for which the classifier 30 cannot determine the type (that is, the ratio that cannot be determined) and the threshold value.
  • each threshold value and the indistinguishable ratio may be associated with each other, and the relationship between the threshold value and the indistinguishable ratio may be expressed by a mathematical expression.
  • the higher the threshold value the higher the non-discriminatory ratio. Therefore, the function representing the relationship between the threshold and the non-discriminable ratio is represented by an arbitrary monotonous non-decreasing function.
  • the storage unit 63 may store the correspondence between the indistinguishable ratio and the threshold for each model used by the image type estimation unit 20.
  • FIG. 5 is an explanatory diagram showing an example in which an indistinguishable ratio is associated with each threshold value.
  • the threshold value and the indistinguishable ratio are associated with each model and type used by the image type estimation unit 20. For example, when the probability of type A is estimated using the model ⁇ , if the threshold is set to 80%, it indicates that the input image is determined to be indistinguishable with an estimated probability of 10%.
  • the storage unit 63 may store a correspondence relationship between the indistinguishable ratio and the threshold according to past experience and the like. Further, the threshold value setting unit 62 described later may calculate the ratio of non-discriminable images from the discrimination result of the actually input image. The operation of the threshold setting unit 62 will be described later.
  • the storage unit 63 is realized by a magnetic disk device, for example.
  • the threshold setting unit 62 sets a threshold used when the discriminator 30 discriminates the type of image. Specifically, the threshold setting unit 62 can be assigned to determine the number of images (hereinafter, referred to as indistinguishable estimated images) estimated that the discriminator 30 cannot determine the type and the type of the indistinguishable estimated image.
  • the threshold is set according to the number of personnel. It can be said that the number of personnel that can be assigned to the classification determination is the number of personnel that perform the identification confirmation work. Specifically, the correct label is set to the indistinguishable estimation image by this person.
  • N is the number of personnel that can be invested in the period T (hours) during which the discrimination work is performed. Further, the number of images that can be discriminated in one unit period by one person is assumed to be W (time / person / sheet).
  • the number of images input during the period T is X (sheets)
  • the number Xn (sheets) of indistinguishable estimated images is expressed by the following formula 1, assuming that the indistinguishable ratio is P. .
  • the threshold setting unit 62 determines a threshold that satisfies the indistinguishable ratio P ⁇ T ⁇ N / W ⁇ Xn. Then, the threshold setting unit 62 notifies the discriminator 30 of the determined threshold, and sets the notified threshold. In the example illustrated in FIG. 5, for example, it is assumed that the classification impossible ratio of type A is calculated as 8%. In this case, the threshold value setting unit 62 determines the threshold value that sets the highest threshold value, 75%, among the threshold values that satisfy the indistinguishable ratio 8%.
  • the threshold setting unit 62 may calculate a non-discrimination ratio for each threshold set from the discrimination result of the actually input image. Specifically, when the threshold setting unit 62 receives an instruction to calculate the indistinguishable ratio from the user, the threshold setting unit 62 inputs an image for threshold calculation to the image input unit 10 and cannot determine the number of input images while changing the threshold. An indistinguishable ratio is calculated based on the number of images, and the calculation result is registered in the storage unit 63. For example, image data stored in the test data storage unit 32 may be used as the threshold calculation image.
  • the threshold setting unit 62 changes the situation for estimating the likelihood of the image type (for example, the model used by the image type estimation unit 20 is updated). Depending on the situation, the indistinguishable ratio for each threshold value may be recalculated. That is, the threshold setting unit 62 determines the non-discrimination ratio for each threshold at the timing when the evaluation unit 60 updates the model, in the same manner as when the evaluation unit 60 updates the model at the timing when the evaluation unit 60 updates the model in the first embodiment. May be recalculated and the information in the storage unit 63 may be updated.
  • the image input unit 10, the image type estimation unit 20, the discriminator 30, the type setting unit 40, the learning device 50, the evaluation unit 60, and the threshold setting unit 62 are according to a program (image determination program). It is realized by a CPU of an operating computer.
  • FIG. 6 is a flowchart illustrating an operation example of the image determination system according to the second embodiment.
  • the threshold value setting unit 62 sets the threshold value of the discriminator 30 according to the number of indistinguishable estimated images and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimated images (step S21).
  • the image type estimation unit 20 estimates the likelihood of the type of the input image (step S22). Then, the discriminator 30 discriminates the image type whose probability of the estimated type exceeds the threshold value based on the set threshold value (step S23).
  • the threshold value setting unit 62 sets the threshold value according to the number of indistinguishable estimation images and the number of personnel that can be assigned to the discrimination of the indistinguishable estimation image type. Then, the discriminator 30 discriminates image types whose probability estimated by the image type estimation unit 20 exceeds the set threshold value.
  • FIG. 7 is a block diagram showing an outline of an image determination system according to the present invention.
  • the image determination system 90 (for example, the image determination system 200) according to the present invention has a likelihood of the type output by the estimator (for example, the image type estimation unit 20) that estimates the likelihood of the type of the input image.
  • a discrimination unit 91 eg, the discriminator 30
  • a threshold setting unit 92 eg, a threshold setting unit 62
  • the threshold setting unit 92 determines the threshold according to the number of indistinguishable estimated images that are estimated to be the type that cannot be discriminated by the discriminating unit 91 and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimated image. Set.
  • the image determination system 90 may include a storage unit (for example, the storage unit 63) that stores the correspondence relationship between the determination impossible ratio, which is the ratio of images for which the determination unit 91 cannot determine the type, and the threshold value. Good. Then, the threshold setting unit 92 calculates the number of indistinguishable estimated images for each threshold from the number of images to be discriminated and the indistinguishable ratio, and assigns the calculated number to the personnel that can be assigned to discriminate the indistinguishable estimated images. A threshold value may be set accordingly.
  • the threshold value setting unit 92 may set the highest threshold value among the threshold values that can be determined by the assignable personnel for the calculated number of indistinguishable estimation images.
  • the threshold setting unit 92 may calculate the indistinguishable ratio while changing the threshold based on the number of indistinguishable images with respect to the number of input images, and may register the calculation result in the storage unit.
  • the threshold value setting unit 92 may calculate an indistinguishable ratio for each threshold value according to a change in the situation for estimating the likelihood of the image type (for example, a change in the model used by the image type estimation unit 20).

Abstract

An image determination system 90 is provided with an identification unit 91 and a threshold value setting unit 92. The identification unit 91 identifies the type of image in which the probability of the type outputted by an estimator for estimating the probability of a type of image to be inputted exceeds a threshold value having been set. The threshold value setting unit 92 sets the threshold value in accordance with the number of unidentifiable estimated images the types of which are estimated to be unidentifiable by the identification unit 91 and the number of personnel to whom identification of types of the unidentifiable estimated images can be assigned.

Description

画像判定システム、画像判定方法および画像判定プログラムImage determination system, image determination method, and image determination program
 本発明は、画像の種別を判定する画像判定システム、画像判定方法および画像判定プログラムに関する。 The present invention relates to an image determination system, an image determination method, and an image determination program for determining the type of image.
 様々な種別の画像に対して対応する後続処理を適切に行うため、画像の内容を人手で確認して、画像を各種別に分類することが一般的に行われている。一方、人手で確認する作業を効率化し、作業コストを低減させる方法も提案されている。 In order to appropriately perform the subsequent processing corresponding to various types of images, it is generally performed to manually check the contents of the images and classify the images into various types. On the other hand, a method for improving the efficiency of manual confirmation and reducing the work cost has been proposed.
 特許文献1には、画像種別の自動判別方法が記載されている。特許文献1に記載された判別方法では、画像の種別をモノクロ画像とカラー画像に大別し、更にモノクロ画像をモノクロ写真、ワープロ・新聞、綱点写真に、カラー画像を原色の絵、カラー写真にそれぞれ細別する。そして、特許文献1に記載された方法では、画像種別が既知の画像を読み込んで画像種別が既知のデジタル画像を作成し、画像の彩度上限、明度と彩度のフラクタル次元を計算し、自動判別のための閾値を決定する。 Patent Document 1 describes an automatic discrimination method for image types. In the discrimination method described in Patent Document 1, the type of image is broadly classified into a monochrome image and a color image, and further, the monochrome image is converted into a monochrome photograph, a word processor / newspaper, and a tie-point photograph. Subdivide each. In the method described in Patent Document 1, an image with a known image type is read to create a digital image with a known image type, and an upper limit of saturation of the image, a fractal dimension of brightness and saturation are calculated, and automatic A threshold for determination is determined.
特開平06-350861号公報JP-A-06-350861
 学習するための画像が少ない場合や、判定が難しいような類似する種別の画像が混在するような場合では、全ての画像に対して精度の高い判定を行うことは困難である。このような状況では、判定器を利用して画像データの種別の確からしさをもとに種別を自動判別したとしても、人手による再確認が必要不可欠である。 When there are few images to learn or when similar types of images that are difficult to determine are mixed, it is difficult to make a highly accurate determination for all images. In such a situation, even if the type is automatically determined based on the certainty of the type of image data using the determiner, manual reconfirmation is indispensable.
 判別不可能な画像の数を減らすためには、画像の種別を自動判別する閾値を低く設定することが考えられる。しかし、閾値を低くし過ぎると、判別ミスを誘発するおそれが高くなるため、人手で確認する作業および修正の作業が増加してしまう可能性がある。一方、初期段階から閾値を高く設定してしまうと、画像がどの種別にも判別されなくなってしまう恐れが高くなるため、人手で正解ラベルを設定する作業が増加する可能性もある。 In order to reduce the number of images that cannot be discriminated, it is conceivable to set a low threshold for automatically discriminating the type of image. However, if the threshold value is set too low, there is a high possibility that a determination error will be induced, and there is a possibility that the work for manual confirmation and the work for correction will increase. On the other hand, if the threshold value is set high from the initial stage, there is a high possibility that the image will not be discriminated by any type, and there is a possibility that the work of manually setting the correct answer label may increase.
 特許文献1に記載された方法では、画像種別の性質に応じて閾値が一意に決定される。しかし、確認すべき画像の数が増加し、判別できない画像の数も増加した場合、想定する人員では、画像の内容を確認しきれなくなり、全体としての処理が不完全になってしまう恐れがある。そのため、画像を自動判別する精度が低くなりすぎることを抑制しつつ、対応可能な人員の状況に応じて、画像を自動判別する基準を調整できることが好ましい。 In the method described in Patent Document 1, the threshold value is uniquely determined according to the nature of the image type. However, if the number of images to be confirmed increases and the number of images that cannot be identified increases, the assumed personnel may not be able to confirm the content of the images, and the overall processing may be incomplete. . For this reason, it is preferable that the reference for automatically determining the image can be adjusted in accordance with the situation of personnel that can be handled while suppressing the accuracy of automatically determining the image from becoming too low.
 そこで、本発明は、画像を判別する精度が低減することを抑制しつつ人員の状況に応じて画像を判別する基準を調整できる画像判定システム、画像判定方法および画像判定プログラムを提供することを目的とする。 SUMMARY OF THE INVENTION An object of the present invention is to provide an image determination system, an image determination method, and an image determination program capable of adjusting a reference for determining an image according to the situation of a person while suppressing a decrease in accuracy of determining an image. And
 本発明による画像判定システムは、入力される画像の種別の確からしさを推定する推定器によって出力される種別の確からしさが、設定された閾値を超える画像の種別を判別する判別部と、閾値を設定する閾値設定部とを備え、閾値設定部が、判別部が種別を判別できないと推定される画像である判別不可推定画像の数と、その判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて閾値を設定することを特徴とする。 An image determination system according to the present invention includes: a determination unit that determines the type of an image for which the likelihood of the type output by the estimator that estimates the likelihood of the type of the input image exceeds a set threshold; A threshold setting unit to be set, and the threshold setting unit can be assigned to determine the number of indistinguishable estimated images that are estimated to be the type that the discriminating unit cannot discriminate, and the type of the indistinguishable estimated image The threshold value is set according to the number of.
 本発明による画像判定方法は、入力される画像の種別の確からしさを推定する推定器によって出力される種別の確からしさが、設定された閾値を超える画像の種別を判別し、種別を判別できないと推定される画像である判別不可推定画像の数と、その判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて閾値を設定することを特徴とする。 The image determination method according to the present invention determines the type of an image in which the likelihood of the type output by the estimator that estimates the likelihood of the type of the input image exceeds a set threshold, and cannot determine the type. The threshold value is set according to the number of estimated non-discriminable images that are estimated images and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimated image.
 本発明による画像判定プログラムは、コンピュータに、入力される画像の種別の確からしさを推定する推定器によって出力される種別の確からしさが、設定された閾値を超える画像の種別を判別する判別処理、および、閾値を設定する閾値設定処理を実行させ、閾値設定処理で、判別処理で種別を判別できないと推定される画像である判別不可推定画像の数と、その判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて閾値を設定させることを特徴とする。 An image determination program according to the present invention is a determination process for determining the type of an image in which the likelihood of the type output by the estimator that estimates the likelihood of the type of image input to the computer exceeds a set threshold value, In addition, the threshold setting process for setting the threshold is executed, and in the threshold setting process, the number of indistinguishable estimation images that are estimated to be the type that cannot be discriminated by the discrimination process and the type of the indistinguishable estimation image A threshold value is set according to the number of assignable personnel.
 本発明によれば、画像を判別する精度が低減することを抑制しつつ人員の状況に応じて画像を判別する基準を調整できる。 According to the present invention, it is possible to adjust the reference for discriminating the image according to the situation of the personnel while suppressing the reduction of the accuracy of discriminating the image.
本発明による画像判定システムの第一の実施形態の構成例を示すブロック図である。It is a block diagram which shows the structural example of 1st embodiment of the image determination system by this invention. 第一の実施形態の画像判定システムの動作例を示すフローチャートである。It is a flowchart which shows the operation example of the image determination system of 1st embodiment. 画像判定システムの動作の具体例を示す説明図である。It is explanatory drawing which shows the specific example of operation | movement of an image determination system. 本発明による画像判定システムの第二の実施形態の構成例を示すブロック図である。It is a block diagram which shows the structural example of 2nd embodiment of the image determination system by this invention. 閾値ごとに判別不可比率を対応付けた例を示す説明図である。It is explanatory drawing which shows the example which matched the non-discriminatory ratio for every threshold value. 第二の実施形態の画像判定システムの動作例を示すフローチャートである。It is a flowchart which shows the operation example of the image determination system of 2nd embodiment. 本発明による画像判定システムの概要を示すブロック図である。It is a block diagram which shows the outline | summary of the image determination system by this invention.
 以下、本発明の実施形態を図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
実施形態1.
 図1は、本発明による画像判定システムの第一の実施形態の構成例を示すブロック図である。本実施形態の画像判定システム100は、画像入力部10と、画像種別推定部20と、判別器30と、再学習用データ記憶部31と、テスト用データ記憶部32と、種別設定部40と、学習器50と、評価部60と、出力部70とを備えている。
Embodiment 1. FIG.
FIG. 1 is a block diagram showing a configuration example of a first embodiment of an image determination system according to the present invention. The image determination system 100 of the present embodiment includes an image input unit 10, an image type estimation unit 20, a discriminator 30, a relearning data storage unit 31, a test data storage unit 32, and a type setting unit 40. The learning device 50, the evaluation unit 60, and the output unit 70 are provided.
 出力部70は、画像判定システムによる各種結果を出力する。出力部70は、例えば、ディスプレイ装置や、プリンタなどにより実現される。 The output unit 70 outputs various results by the image determination system. The output unit 70 is realized by, for example, a display device or a printer.
 画像入力部10は、画像種別推定部20が種別の推定に用いる画像を入力する。画像入力部10は、通信ネットワークを介して画像を入力してもよいし、他のシステムの記憶部(図示せず)に記憶された画像を読み取ってもよい。 The image input unit 10 inputs an image used by the image type estimation unit 20 to estimate the type. The image input unit 10 may input an image via a communication network, or may read an image stored in a storage unit (not shown) of another system.
 画像種別推定部20は、画像が表す種別ごとの確からしさを推定するモデル(以下、単にモデルまたは学習済モデルと記す。)を用いて、入力された画像の種別の確からしさを推定する。種別の確からしさの表し方は任意であり、例えば、種別らしさの確率であってもよく、種別ごとの信頼度であってもよい。 The image type estimation unit 20 estimates the likelihood of the type of the input image using a model that estimates the likelihood of each type represented by the image (hereinafter simply referred to as a model or a learned model). The method of expressing the probability of the type is arbitrary, and may be, for example, the probability of the type or the reliability for each type.
 画像の分類に用いられる種別の設定方法は任意である。種別は、異なる種類の画像ごとに設定されてもよく、同じ種類の画像について、その画像が表す対象の程度に応じて設定されてもよい。異なる種類の画像ごとに設定される例として、異なる種類の帳票ごと(住民票、婚姻届、印鑑証明など)、地図ごと(道路地図、地形図など)、見取り図ごと(住居用、オフィス用、など)に設定する場合が挙げられる。また、同じ種類の画像ごとに設定される例として、同じ建築物の劣化の度合いや病変の進行度合いに応じて設定する場合が挙げられる。 * The type setting method used for image classification is arbitrary. The type may be set for each different type of image, or may be set for the same type of image according to the degree of the object represented by the image. Examples of settings for different types of images include different types of forms (resident card, marriage registration, seal certificate, etc.), maps (road maps, topographic maps, etc.), floor plans (residential, office, etc.) ) Is set. In addition, as an example of setting for each image of the same type, there is a case where the setting is made according to the degree of deterioration of the same building or the degree of progression of lesions.
 画像種別推定部20が使用するモデルの態様は任意であり、例えば、ディープラーニングにより生成されたモデルである。この場合、画像種別推定部20は、入力された画像をモデルに適用して、種別ごとの確率を推定してもよい。例えば、画像Aの種別の候補が種別A~Dの4種類存在し、種別Aらしさが最も高い場合、画像種別推定部20は、種別ごとに「種別A:85%」、「種別B:10%」、「種別C:3%」、「種別D:2%」のように確率を推定してもよい。なお、画像種別推定部20が使用するモデルを学習する方法については後述される。 The mode of the model used by the image type estimation unit 20 is arbitrary, for example, a model generated by deep learning. In this case, the image type estimation unit 20 may estimate the probability for each type by applying the input image to the model. For example, when there are four types of types of image A, that is, types A to D, and the type A is most likely, the image type estimation unit 20 sets “type A: 85%” and “type B: 10 for each type. The probability may be estimated as “%”, “Category C: 3%”, or “Category D: 2%”. A method for learning a model used by the image type estimation unit 20 will be described later.
 判別器30は、画像種別推定部20によって推定された画像の種別の確からしさに基づいて、画像の種別を判別する。具体的には、判別器30は、画像種別推定部20が推定に使用したモデルによって推定される確からしさが閾値を超える画像の種別を判別する。 The discriminator 30 discriminates the image type based on the probability of the image type estimated by the image type estimation unit 20. Specifically, the discriminator 30 discriminates image types whose probability estimated by the model used by the image type estimation unit 20 exceeds a threshold value.
 判別に用いられる閾値には、種別ごとに値が設定されていてもよく、種別全体で共通の値が設定されていてもよい。例えば、推定が容易な種別には、他の種別の閾値よりも高い閾値が設定されてもよい。 A value may be set for each type of threshold value used for determination, or a common value may be set for the entire type. For example, a threshold that is higher than the thresholds of other types may be set for a type that is easy to estimate.
 また、例えば、複数の種別が設定されていても、ある種別(例えば、種別Xとする。)とそれ以外の種別との判断が出来ればよい場合や、非常に高い確率で判別可能な種別(例えば、種別Yとする。)が存在する場合も考えられる。このような場合、その種別(種別X,種別Y)のみ判別器30に判別させればよいような場合、その種別に対して適切な閾値を設定し、それ以外の種別に対しては、非常に高い閾値を設定してもよい。このようにすることで、人手で判別させることが好ましい種別の画像を抽出することが可能になる。 Further, for example, even when a plurality of types are set, it is only necessary to be able to determine a certain type (for example, type X) and other types, or a type that can be discriminated with a very high probability ( For example, there may be a case where type Y is present. In such a case, when only the type (type X, type Y) needs to be discriminated by the discriminator 30, an appropriate threshold value is set for the type, and an emergency threshold is set for other types. A high threshold may be set. In this way, it is possible to extract a type of image that is preferably discriminated manually.
 第一の実施形態において、判別器30が用いる閾値は、後述する評価部60によって更新される。初期段階では、ユーザの経験等や既存の推定確率に基づいて閾値が設定されてもよい。また、現在のモデルを用いて正解データが付与された画像の種別の確率を推定し、推定される確率を満たすような閾値が設定されてもよい。なお、閾値を更新する方法は後述される。 In the first embodiment, the threshold value used by the discriminator 30 is updated by the evaluation unit 60 described later. In the initial stage, a threshold value may be set based on the user's experience or the like or an existing estimated probability. In addition, a threshold that satisfies the estimated probability may be set by estimating the probability of the type of image to which the correct data is assigned using the current model. A method for updating the threshold will be described later.
 判別器30は、種別を判別した画像(すなわち、種別を判別できた画像)をテスト用データ記憶部32に登録する。また、判別器30は、種別を判別した画像を後続の処理に送信してもよい。判別器30は、例えば、種別を判別した画像を、出力部70に種別とともに出力させてもよいし、種別に対応する後続処理を行う処理器(図示せず)に画像を通知してもよい。 The discriminator 30 registers the image whose type has been discriminated (that is, the image whose type has been discriminated) in the test data storage unit 32. Further, the discriminator 30 may transmit an image whose type has been discriminated to subsequent processing. For example, the discriminator 30 may cause the output unit 70 to output an image with the type discriminated together with the type, or may notify the image to a processing unit (not shown) that performs subsequent processing corresponding to the type. .
 一方、判別器30は、画像の種別が判別できなかった場合、種別が判別できなかった画像(以下、判別不可画像と記す。)を再学習用データ記憶部31に登録する。登録された画像は、後続の種別設定部40で利用される。なお、判別器30は、判別不可画像を再学習用データ記憶部31に登録せず、種別設定部40に直接送信してもよい。 On the other hand, the discriminator 30 registers an image whose type could not be discriminated (hereinafter referred to as an indistinguishable image) in the relearning data storage unit 31 when the type of the image could not be discriminated. The registered image is used in the subsequent type setting unit 40. Note that the discriminator 30 may directly transmit the indistinguishable image to the type setting unit 40 without registering it in the relearning data storage unit 31.
 種別設定部40は、判別器30によって種別が判別できなかった画像(すなわち、判別不可画像)に対して正解データ(ラベル)である種別を付与する。種別設定部40は、例えば、ユーザに対して判別不可画像を表示し、人手で正解データを付与してもよい。なお、正解データが付与された判別不可画像は、後続の再学習処理で教師データとして用いられるため、精度高く正解データが付与されることが好ましい。また、正解ラベルが付与された画像はテストデータとして用いることが可能であるため、種別設定部40は、この画像をテスト用データ記憶部32に記憶する。 The type setting unit 40 assigns a type that is correct answer data (label) to an image whose type cannot be determined by the classifier 30 (that is, an image that cannot be determined). For example, the type setting unit 40 may display an indistinguishable image for the user and manually add correct data. In addition, since the indistinguishable image to which correct data is assigned is used as teacher data in subsequent relearning processing, it is preferable that correct data is assigned with high accuracy. Further, since the image with the correct answer label can be used as test data, the type setting unit 40 stores this image in the test data storage unit 32.
 学習器50は、判別器30で用いられるモデル(すなわち、画像が表す種別ごとの確からしさを推定するモデル)を学習用の画像データを用いて学習する。初期段階では、学習器50は、例えば、既存の画像データを学習データとして用いてモデルを学習すればよい。 The learning device 50 learns a model used in the discriminator 30 (that is, a model for estimating the probability for each type represented by the image) using the learning image data. In the initial stage, the learning device 50 may learn a model using, for example, existing image data as learning data.
 さらに、学習器50は、判別不可画像に対して正解ラベルが付与されたデータを学習用画像データとして用いて、モデルを再学習する。判別不可画像は、モデルが種別の精度を高く推定できなかった画像、または、判別器30が要求する精度(すなわち、設定された閾値の精度)で種別を推定できなかった画像であると言える。このような画像に対して正解ラベルを付与した学習用画像データを生成し、この学習用画像データに基づいて再学習を行うことにより、再学習済モデルが種別を推定する精度を向上させることが可能になる。 Furthermore, the learning device 50 re-learns the model by using the data with the correct answer label assigned to the indistinguishable image as learning image data. A non-discriminable image can be said to be an image for which the model has not been able to estimate the type accuracy at a high level, or an image for which the type could not be estimated with the accuracy required by the discriminator 30 (that is, the accuracy of the set threshold). It is possible to improve the accuracy with which the re-learned model estimates the type by generating learning image data to which a correct answer label is assigned to such an image and performing relearning based on the learning image data. It becomes possible.
 学習器50が再学習を行う方法は任意である。例えば、種別が増加しない場合、学習器50は、学習用画像データを用いて追加学習を行ってもよいし、過去に学習に用いた画像と正解ラベルが付与された判別不可画像とを学習データとして用いて、新たにモデルの学習を行ってもよい。 The method for the relearning by the learning device 50 is arbitrary. For example, if the type does not increase, the learning device 50 may perform additional learning using the learning image data, or the learning data may be an image used for learning in the past and an indistinguishable image provided with a correct answer label. May be used to learn a new model.
 その際、学習器50が再学習に用いるデータも、学習方法に応じて選択することが可能である。学習器50は、例えば、正解ラベルが付与された判別不可画像の全てを学習用画像データとして用いて、モデルの再学習を行ってもよい。また、学習時間の短縮や、特定の種別に対する推定精度の強化のため、学習器50は、正解ラベルが付与された判別不可画像のうち、特定の種別の画像のみを学習用画像データとして用いてもよい。 At that time, the data used by the learning device 50 for re-learning can also be selected according to the learning method. The learning device 50 may perform relearning of the model using, for example, all the images that cannot be discriminated to which the correct answer label is assigned as learning image data. Further, in order to shorten the learning time and enhance the estimation accuracy for a specific type, the learning device 50 uses only a specific type of image as learning image data among non-distinguishable images assigned with correct labels. Also good.
 例えば、該当すると推定される場合の確率が低い種別(すなわち、他の種別よりも確率が高く、その種別に該当すると推定される場合であっても、その確率が低い種別)の推定精度を向上させたいとする。例えば、種別A~Dが存在する場合に、種別Dの推定確率が51%だったとする。この場合、他の種別と比較すると最も高く推定される種別ではあるが、推定確率自体は低いと考えられる。この場合、学習器50は、その種別(例えば、種別D)が正解ラベルとして設定された学習用画像データを用いて、再学習を行ってもよい。 For example, the estimation accuracy of a type with a low probability of being estimated to be applicable (that is, a type with a higher probability than other types and a low probability even if it is assumed to be applicable to that type) is improved. I want to let you. For example, when there are types A to D, the estimated probability of type D is 51%. In this case, it is considered that the estimation probability itself is low although it is the type that is estimated to be highest when compared with other types. In this case, the learning device 50 may perform relearning using learning image data in which the type (for example, type D) is set as the correct answer label.
 また、学習器50は、判別不可画像のうち、最も多く設定された(または、最も割合が多い)種別の画像データを用いて再学習を行ってもよい。このような種別の画像は、推定精度が低いと考えられるからである。このような画像を用いて再学習を行うことで、推定精度の低い種別について、その種別を精度よく推定することが可能なモデルを生成できる。 In addition, the learning device 50 may perform relearning using image data of the type that is set most (or most frequently) among the images that cannot be discriminated. This is because such types of images are considered to have low estimation accuracy. By performing relearning using such an image, it is possible to generate a model that can accurately estimate the type of a type with low estimation accuracy.
 一方、学習器50が再学習を行うタイミングも任意である。学習器50は、所定期間経過後に再学習を行ってもよく、精度を確保できそうな数として予め定めた数の学習用画像データが溜まった場合に再学習を行ってもよい。また、学習器50は、判別器30によって種別が判別できない画像の割合(以下、判別不可比率と記す。)が所定の基準を満たさなくなった場合に、再学習を行ってもよい。学習器50は、例えば、判別対象の画像の全体数に対して種別が判別できなかった画像の数の割合を判別不可比率として算出し、判別不可比率が予め定めた基準を超えた場合に再学習を行ってもよい。このようにすることで、例えば、入力される画像の傾向が変わった場合、自動で再学習できるようになるため好ましい。 On the other hand, the timing at which the learning device 50 performs relearning is also arbitrary. The learning device 50 may perform relearning after a lapse of a predetermined period, or may perform relearning when a predetermined number of pieces of learning image data have accumulated so that accuracy can be ensured. In addition, the learning device 50 may perform relearning when the ratio of images whose type cannot be determined by the determination device 30 (hereinafter referred to as an undistinguishable ratio) does not satisfy a predetermined criterion. The learning device 50 calculates, for example, the ratio of the number of images whose type could not be determined with respect to the total number of images to be determined as an indistinguishable ratio, and re-enters when the indistinguishable ratio exceeds a predetermined reference. You may learn. This is preferable because, for example, when the tendency of the input image changes, it becomes possible to automatically relearn.
 評価部60は、学習器50によって学習されたモデルの精度を評価する。そして、評価部60は、既存のモデル(再学習前のモデル)と再学習後のモデルの精度を比較して、より精度の高いモデルを選択する。評価部60は、選択されたモデルを画像種別推定部20が使用するモデルとして自動的に置き換えてもよく、ユーザ等の指示に応じて置き換えてもよい。 The evaluation unit 60 evaluates the accuracy of the model learned by the learning device 50. Then, the evaluation unit 60 compares the accuracy of the existing model (the model before relearning) with the model after the relearning, and selects a model with higher accuracy. The evaluation unit 60 may automatically replace the selected model as a model used by the image type estimation unit 20, or may replace it according to an instruction from a user or the like.
 評価部60は、単一の指標に基づいてモデルの精度を評価してもよく、複数の指標を総合的に判断してモデルの精度を評価してもよい。その際、評価部60は、テスト用データ記憶部32に記憶されたデータである種別が判別できた画像および判別不可画像の両方を利用してモデルの精度を評価してもよく、いずれか一方のデータを利用してモデルの精度を評価してもよい。評価部60は、例えば、既存のモデルと再学習後のモデルとを並行して判別不可画像を判別させ、各種別の確からしさが、既存のモデルよりも再学習後のモデルの方が高くなった場合に、精度が高くなったと判断して再学習後のモデルを選択してもよい。 The evaluation unit 60 may evaluate the accuracy of the model based on a single index, or may evaluate the accuracy of the model by comprehensively judging a plurality of indices. At that time, the evaluation unit 60 may evaluate the accuracy of the model by using both an image whose type is data stored in the test data storage unit 32 and an unidentifiable image. The accuracy of the model may be evaluated using this data. For example, the evaluation unit 60 discriminates an indistinguishable image in parallel between an existing model and a model after re-learning, and the probability of each type is higher in the model after re-learning than in the existing model. In such a case, the model after re-learning may be selected by determining that the accuracy has increased.
 また、評価部60は、既存のモデルで判別不可と判断された画像を再学習後のモデルで判別した結果、判別が可能になった数(または割合)を算出してもよい。そして、評価部60は、算出した数(または割合)が所定の基準を満たした場合に、再学習後のモデルがより精度の高いモデルになったと判断して、そのモデルを選択してもよい。 Further, the evaluation unit 60 may calculate the number (or rate) at which discrimination is possible as a result of discriminating images determined to be indistinguishable with the existing model with the model after relearning. Then, when the calculated number (or ratio) satisfies a predetermined criterion, the evaluation unit 60 may determine that the model after re-learning has become a more accurate model and select the model. .
 他にも、評価部60は、複数のテストデータを既存のモデルと再学習後のモデルの双方に適用して、種別ごとの確からしさを算出し、テストデータごとの評価結果を統計処理(例えば、確からしさの平均値を算出、確からしさの標準偏差を算出)してもよい。例えば、統計処理として平均値が算出された場合、評価部60は、確からしさの平均値が高いモデルをより精度の高いモデルとして選択してもよい。また、例えば、統計処理として標準偏差が算出された場合、評価部60は、確からしさの標準偏差が小さい(すなわち、評価にブレが少ない)モデルをより精度の高いモデルとして選択してもよい。なお、統計処理に用いるテストデータは、学習させた種別の過去のデータ全てであってもよく、予め選択した件数のデータであってもよい。 In addition, the evaluation unit 60 applies a plurality of test data to both the existing model and the model after relearning, calculates the certainty for each type, and statistically processes the evaluation result for each test data (for example, The average value of the certainty may be calculated, and the standard deviation of the certainty may be calculated). For example, when the average value is calculated as the statistical process, the evaluation unit 60 may select a model having a high probability average value as a more accurate model. For example, when the standard deviation is calculated as the statistical process, the evaluation unit 60 may select a model with a small standard deviation of probability (that is, less evaluation blur) as a more accurate model. Note that the test data used for the statistical processing may be all the past data of the learned type, or may be data of the number selected in advance.
 さらに、評価部60は、画像種別推定部20が使用するモデルを更新する際に、判別器30が用いる閾値を併せて更新してもよい。例えば、更新の際に閾値を変更する度合いを予め定めておき、評価部60は、その度合いに応じて閾値を更新してもよい。このとき、定めておく度合いは、種別ごとであってもよく、全体で共通であってもよい。 Furthermore, the evaluation unit 60 may update the threshold used by the discriminator 30 when updating the model used by the image type estimation unit 20. For example, the degree of changing the threshold value during the update may be determined in advance, and the evaluation unit 60 may update the threshold value according to the degree. At this time, the predetermined degree may be for each type or may be common throughout.
 他にも、評価部60は、既存のモデルと再学習後のモデルとを比較した結果、向上した精度に応じて閾値を増加させる度合いを決定してもよい。例えば、向上した精度に応じて閾値を変化させた場合の過去の判別不可比率の変化を学習しておき、評価部60は、向上した精度と向上させた閾値と判別不可比率との関係から、閾値を変化させる度合いを決定してもよい。 Besides, as a result of comparing the existing model with the model after re-learning, the evaluation unit 60 may determine the degree to increase the threshold according to the improved accuracy. For example, the change in the past non-discrimination ratio when the threshold value is changed in accordance with the improved accuracy is learned, and the evaluation unit 60 determines from the relationship between the improved accuracy and the improved threshold value and the non-discrimination ratio, The degree to which the threshold value is changed may be determined.
 具体的には、精度の高いモデルに更新することにより画像種別推定部20による推定確率が向上すれば判別不可比率は減少する。一方、判別器30が用いる判別の閾値を増加させれば判別不可比率が増加することになる。そのため、評価部60は、予め定めた許容できる判別不可比率に応じて、閾値を増加させてもよい。 More specifically, if the estimation probability by the image type estimation unit 20 is improved by updating to a highly accurate model, the indistinguishable ratio decreases. On the other hand, if the discrimination threshold used by the discriminator 30 is increased, the discrimination impossible ratio increases. Therefore, the evaluation unit 60 may increase the threshold according to a predetermined allowable discrimination impossible ratio.
 再学習用データ記憶部31は、判別不可画像を記憶する。なお、再学習用データ記憶部31は、判別不可画像を種別設定部40によって付与された正解ラベルに対応付けて記憶してもよい。また、テスト用データ記憶部32は、判別または付加された種別を画像と対応付けて記憶する。再学習用データ記憶部31およびテスト用データ記憶部32は、例えば、磁気ディスク装置により実現される。 The re-learning data storage unit 31 stores indistinguishable images. The relearning data storage unit 31 may store the indistinguishable image in association with the correct label given by the type setting unit 40. Further, the test data storage unit 32 stores the type determined or added in association with the image. The relearning data storage unit 31 and the test data storage unit 32 are realized by, for example, a magnetic disk device.
 画像入力部10と、画像種別推定部20と、判別器30と、種別設定部40と、学習器50と、評価部60とは、プログラム(画像判定プログラム)に従って動作するコンピュータのCPU(Central Processing Unit )によって実現される。例えば、プログラムは、画像判定システムが備える記憶部(図示せず)に記憶され、CPUは、そのプログラムを読み込み、プログラムに従って、画像入力部10、画像種別推定部20、判別器30、種別設定部40、学習器50および評価部60として動作してもよい。 The image input unit 10, the image type estimation unit 20, the discriminator 30, the type setting unit 40, the learning unit 50, and the evaluation unit 60 are a CPU (Central Processing) of a computer that operates according to a program (image determination program). Realized by Unit IV). For example, the program is stored in a storage unit (not shown) included in the image determination system, and the CPU reads the program, and in accordance with the program, the image input unit 10, the image type estimation unit 20, the discriminator 30, and the type setting unit. 40, the learning device 50, and the evaluation unit 60 may be operated.
 画像入力部10と、画像種別推定部20と、判別器30と、種別設定部40と、学習器50と、評価部60とは、それぞれが専用のハードウェアで実現されていてもよい。 The image input unit 10, the image type estimation unit 20, the discriminator 30, the type setting unit 40, the learning device 50, and the evaluation unit 60 may be realized by dedicated hardware, respectively.
 次に、本実施形態の画像判定システムの動作を説明する。図2は、第一の実施形態の画像判定システムの動作例を示すフローチャートである。なお、画像種別推定部20が初期段階で推定に用いるモデルは予め学習されているとする。 Next, the operation of the image determination system of this embodiment will be described. FIG. 2 is a flowchart illustrating an operation example of the image determination system according to the first embodiment. It is assumed that the model used by the image type estimation unit 20 for estimation in the initial stage is learned in advance.
 画像入力部10は、判別の対象とする画像を入力する(ステップS11)。画像種別推定部20は、入力された画像に対してモデルを適用し、入力された画像の種別の確からしさを推定する(ステップS12)。判別器30は、推定された画像の種別の確からしさに基づいて、画像の種別を判別する(ステップS13)。 The image input unit 10 inputs an image to be determined (step S11). The image type estimation unit 20 applies a model to the input image, and estimates the likelihood of the type of the input image (step S12). The discriminator 30 discriminates the image type based on the estimated likelihood of the image type (step S13).
 画像の種別が判別できなかった場合(ステップS13におけるNo)、種別設定部40は、判別不可画像に対して正解データを付与し(ステップS14)、再学習用データ記憶部31に記憶する。その後ステップS15の処理が行われる。一方、画像の種別が判別できた場合(ステップS13におけるYes)、判別器30は、種別に対応する後続処理を行う処理器(図示せず)に画像を通知する(ステップS15)。また、判別器30は、判別された種別を画像と対応付けてテスト用データ記憶部32に記憶する。 If the image type cannot be determined (No in step S13), the type setting unit 40 assigns correct data to the indistinguishable image (step S14) and stores it in the relearning data storage unit 31. Thereafter, the process of step S15 is performed. On the other hand, when the type of the image can be discriminated (Yes in step S13), the discriminator 30 notifies the image to a processor (not shown) that performs subsequent processing corresponding to the type (step S15). The discriminator 30 stores the discriminated type in the test data storage unit 32 in association with the image.
 学習器50は、判別不可画像に対して正解ラベルが付与されたデータを学習用画像データとして用いて、モデルを再学習する(ステップS16)。学習器50は、予め定められた条件に応じたタイミングおよび方法で再学習を行う。評価部60は、学習器50によって学習されたモデルの精度を評価し、既存のモデルよりも精度が高くなっているか否か判断する(ステップS17)。精度が高くなっている場合(ステップS17におけるYes)、評価部60は、画像種別推定部20が推定に用いている既存のモデルを再学習後のモデルに置き換える(ステップS18)。さらに評価部60は、判別器30が判別に用いる閾値を更新する(ステップS19)。一方、精度が高くなっていない場合(ステップS17におけるNo)、既存のモデルはそのままに処理を終了する。 The learning device 50 re-learns the model by using the data to which the correct label is assigned to the indistinguishable image as the learning image data (step S16). The learning device 50 performs relearning at a timing and method according to a predetermined condition. The evaluation unit 60 evaluates the accuracy of the model learned by the learning device 50 and determines whether the accuracy is higher than that of the existing model (step S17). When the accuracy is high (Yes in step S17), the evaluation unit 60 replaces the existing model used for estimation by the image type estimation unit 20 with a model after re-learning (step S18). Further, the evaluation unit 60 updates the threshold used by the discriminator 30 for discrimination (step S19). On the other hand, if the accuracy is not high (No in step S17), the process is terminated with the existing model as it is.
 次に、本実施形態の画像判定システムの具体例を説明する。図3は、本実施形態の画像判定システムの動作の具体例を示す説明図である。図3に示す例では、画像記憶器11に判別対象の画像が記憶されているとする。図3に例示するAI(Artificial Intelligence )21は、画像種別推定部20に対応し、モデルを使用して種別A、種別B、種別Cおよび種別Dの確からしさを推定する。 Next, a specific example of the image determination system of this embodiment will be described. FIG. 3 is an explanatory diagram showing a specific example of the operation of the image determination system of the present embodiment. In the example illustrated in FIG. 3, it is assumed that an image to be determined is stored in the image storage device 11. AI (Artificial Intelligence) 21 illustrated in FIG. 3 corresponds to the image type estimation unit 20 and estimates the probabilities of type A, type B, type C, and type D using a model.
 次に、判別器30が、推定された種別の確からしさを設定された閾値と比較し、閾値よりも大きい確からしさの種別を特定する。ここで、閾値(80%)よりも大きい種別が特定された画像は、判別済30xに分類され、種別が特定されなかった画像は、判別不可30yに分類される。判別不可30yに分類された画像(すなわち、判別不可画像)に対し、ユーザが種別設定部40を介して、種別A~Dを設定する。このデータは、再学習用データとして利用される。 Next, the discriminator 30 compares the probability of the estimated type with a set threshold value, and identifies a type of probability that is greater than the threshold value. Here, an image for which a type greater than the threshold (80%) is specified is classified as discriminated 30x, and an image for which no type is specified is classified as indistinguishable 30y. The user sets types A to D via the type setting unit 40 for images classified as indistinguishable 30y (that is, indistinguishable images). This data is used as relearning data.
 判別済30xに分類された画像および種別が設定された判別不可画像は、それぞれの処理を行う処理器Pa~Pdに入力され、各処理が行われる。また、これらの画像は、判別済画像の記憶装置33に記憶される。この記憶装置33は、テスト用データ記憶部32に対応する。 The image classified into the discriminated 30x and the indistinguishable image in which the type is set are input to the processors Pa to Pd that perform the respective processes, and each process is performed. Further, these images are stored in the storage device 33 for the discriminated image. The storage device 33 corresponds to the test data storage unit 32.
 その後、AI51は、再学習用データを用いてモデルの再学習を行い、再学習済モデルを生成する。なお、AI51は、学習器50に対応し、予め定められた条件に応じたタイミングおよび方法で再学習を行う。そして、再学習モデルの評価器61は、既存のモデルpmと再学習後のモデルrmの精度を比較して、より精度の高いモデルを選択する。再学習後のモデルrmが選択された場合、評価器61は、AI21が使用するモデルを置き換える。併せて、評価器61は、判別器30が使用する閾値を変更する。なお、評価器61は、評価部60に対応する。 Thereafter, the AI 51 performs relearning of the model using the relearning data, and generates a relearned model. The AI 51 corresponds to the learning device 50 and performs relearning with a timing and method according to a predetermined condition. The relearning model evaluator 61 compares the accuracy of the existing model pm and the model rm after the relearning, and selects a model with higher accuracy. When the model rm after re-learning is selected, the evaluator 61 replaces the model used by the AI 21. In addition, the evaluator 61 changes the threshold used by the discriminator 30. The evaluator 61 corresponds to the evaluation unit 60.
 以上のように、本実施形態では、学習器50が、判別不可画像を学習用データとして用いてモデルを再学習し、評価部60が、再学習前のモデルと再学習後のモデルの精度を比較して、より精度の高いモデルを選択する。よって、使用に応じて画像を判定する精度を向上させることができる。すなわち、使用に応じてより精度の高いモデルが選択されるため、そのモデルを用いて画像を判定することにより、精度を向上させることが可能になる。 As described above, in this embodiment, the learning device 50 re-learns the model using the indistinguishable image as learning data, and the evaluation unit 60 determines the accuracy of the model before re-learning and the model after re-learning. Compare and select a model with higher accuracy. Therefore, the accuracy of determining an image according to use can be improved. That is, since a model with higher accuracy is selected according to use, it is possible to improve accuracy by determining an image using the model.
実施形態2.
 次に、本発明による画像判定システムの第二の実施形態を説明する。画像の種別の判別作業に投入できる人的工数や人件費などには限度があり、限られたリソース内で精度よく判別できることが望まれている。すなわち、人員を多数確保できる状況であれば、判別が困難な画像だけでなく、判別が微妙な画像も含めて人手で種別を判別して正解ラベルを付与できることが好ましい。
Embodiment 2. FIG.
Next, a second embodiment of the image determination system according to the present invention will be described. There is a limit to the man-hours and labor costs that can be put into the image type determination work, and it is desired that the image type can be determined accurately within the limited resources. That is, in a situation where a large number of personnel can be secured, it is preferable that the correct label can be assigned by manually determining the type including not only images that are difficult to determine but also images that are delicately determined.
 一方、人員があまり確保できない状況の場合、判別不可画像が多数発生してしまうと、判別不可画像に正解ラベルを付与する作業が完了できない状況になってしまう。このような状況を避けるために、許容できる精度で判別可能な画像についてはモデルに基づいて自動判別することで判別不可画像の数を低減できることが好ましい。 On the other hand, in a situation where the number of personnel cannot be secured, if a large number of indistinguishable images occur, the task of assigning a correct label to the indistinguishable image cannot be completed. In order to avoid such a situation, it is preferable that the number of images that cannot be discriminated can be reduced by automatically discriminating images that can be discriminated with an acceptable accuracy based on the model.
 第一の実施形態では、モデルの更新に応じて閾値を変化させる場合について説明した。本実施形態では、判別不可画像に対して正解ラベルを付与するために確保可能な人員の状況に応じて、画像を判別する基準である閾値を調整する方法を説明する。 In the first embodiment, the case where the threshold value is changed according to the model update has been described. In the present embodiment, a method for adjusting a threshold, which is a reference for discriminating an image, according to the situation of personnel that can be secured for assigning a correct answer label to an indistinguishable image.
 図4は、本発明による画像判定システムの第二の実施形態の構成例を示すブロック図である。本実施形態の画像判定システム200は、画像入力部10と、画像種別推定部20と、判別器30と、再学習用データ記憶部31と、テスト用データ記憶部32と、種別設定部40と、学習器50と、評価部60と、閾値設定部62と、記憶部63と、出力部70とを備えている。 FIG. 4 is a block diagram showing a configuration example of the second embodiment of the image determination system according to the present invention. The image determination system 200 according to the present embodiment includes an image input unit 10, an image type estimation unit 20, a discriminator 30, a relearning data storage unit 31, a test data storage unit 32, and a type setting unit 40. , A learning device 50, an evaluation unit 60, a threshold setting unit 62, a storage unit 63, and an output unit 70.
 すなわち、本実施形態の画像判定システム200は、第一の実施形態の画像判定システム100と比較し、閾値設定部62および記憶部63をさらに備えている。それ以外の構成は、第一の実施形態と同様である。 That is, the image determination system 200 of the present embodiment further includes a threshold setting unit 62 and a storage unit 63 as compared with the image determination system 100 of the first embodiment. Other configurations are the same as those in the first embodiment.
 評価部60は、第一の実施形態の評価部60と同様、学習器50によって学習されたモデルの精度を評価し、既存のモデル(再学習前のモデル)と再学習後のモデルの精度を比較して、より精度の高いモデルを選択する。なお、本実施形態では、閾値設定部62が判別器30が用いる閾値を更新する場合について説明する。ただし、評価部60が第一の実施形態と同様に閾値を更新してもよい。 Similar to the evaluation unit 60 of the first embodiment, the evaluation unit 60 evaluates the accuracy of the model learned by the learning device 50, and determines the accuracy of the existing model (the model before relearning) and the model after the relearning. Compare and select a model with higher accuracy. In the present embodiment, the case where the threshold setting unit 62 updates the threshold used by the discriminator 30 will be described. However, the evaluation unit 60 may update the threshold as in the first embodiment.
 記憶部63は、判別器30が種別を判別できない画像の割合(すなわち、判別不可比率)と閾値との対応関係を記憶する。対応関係は、個々の閾値と判別不可比率とが対応付けられていてもよく、閾値と判別不可比率との関係が数式で表されていてもよい。なお、一般に閾値が高くなるほど判別不可比率は増加することから、閾値と判別不可比率との関係を表す関数は、任意の単調非減少関数で表される。 The storage unit 63 stores the correspondence between the ratio of the images for which the classifier 30 cannot determine the type (that is, the ratio that cannot be determined) and the threshold value. In the correspondence relationship, each threshold value and the indistinguishable ratio may be associated with each other, and the relationship between the threshold value and the indistinguishable ratio may be expressed by a mathematical expression. In general, the higher the threshold value, the higher the non-discriminatory ratio. Therefore, the function representing the relationship between the threshold and the non-discriminable ratio is represented by an arbitrary monotonous non-decreasing function.
 また、上述するように、閾値には、種別全体で共通の値が用いられていてもよく、種別ごとに異なる値が用いられていてもよい。さらに、記憶部63は、画像種別推定部20が使用するモデルごとに、判別不可比率と閾値との対応関係を記憶してもよい。 Also, as described above, a common value may be used for the entire type as the threshold value, or a different value may be used for each type. Further, the storage unit 63 may store the correspondence between the indistinguishable ratio and the threshold for each model used by the image type estimation unit 20.
 図5は、閾値ごとに判別不可比率を対応付けた例を示す説明図である。図5に示す例では、画像種別推定部20が使用するモデルおよび種別ごとに、閾値と判別不可比率とを対応付けていることを示す。例えば、モデルαを用いて種別Aの確からしさを推定した場合に閾値80%と設定されていると、入力される画像のうち10%の推定確率で判別不可に判別されることを示す。 FIG. 5 is an explanatory diagram showing an example in which an indistinguishable ratio is associated with each threshold value. In the example illustrated in FIG. 5, the threshold value and the indistinguishable ratio are associated with each model and type used by the image type estimation unit 20. For example, when the probability of type A is estimated using the model α, if the threshold is set to 80%, it indicates that the input image is determined to be indistinguishable with an estimated probability of 10%.
 記憶部63には、過去の経験等に応じて判別不可比率と閾値との対応関係が記憶されてもよい。また、後述する閾値設定部62が、実際に入力される画像の判別結果から判別不可画像の割合を算出してもよい。閾値設定部62の動作については後述される。また、記憶部63は、例えば、磁気ディスク装置により実現される。 The storage unit 63 may store a correspondence relationship between the indistinguishable ratio and the threshold according to past experience and the like. Further, the threshold value setting unit 62 described later may calculate the ratio of non-discriminable images from the discrimination result of the actually input image. The operation of the threshold setting unit 62 will be described later. The storage unit 63 is realized by a magnetic disk device, for example.
 閾値設定部62は、判別器30が画像の種別を判別する際に用いる閾値を設定する。具体的には閾値設定部62は、判別器30が種別を判別できないと推定される画像(以下、判別不可推定画像と記す。)の数と、その判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて閾値を設定する。種別の判別に割り当て可能な人員の数は、判別の確認作業を行う人員の数と言うこともできる。具体的には、この人員によって判別不可推定画像への正解ラベルの設定が行われることになる。 The threshold setting unit 62 sets a threshold used when the discriminator 30 discriminates the type of image. Specifically, the threshold setting unit 62 can be assigned to determine the number of images (hereinafter, referred to as indistinguishable estimated images) estimated that the discriminator 30 cannot determine the type and the type of the indistinguishable estimated image. The threshold is set according to the number of personnel. It can be said that the number of personnel that can be assigned to the classification determination is the number of personnel that perform the identification confirmation work. Specifically, the correct label is set to the indistinguishable estimation image by this person.
 判別作業を行う期間T(時間)に対して投入できる人員の数をN(人)とする。また、1人の人員で単位期間に判別できる画像の枚数をW(時間・人/枚)とする。期間Tの間に入力される画像の枚数をX(枚)としたとき、判別不可推定画像の枚数Xn(枚)は、判別不可比率をPとすると、以下に例示する式1で表される。 Suppose N (persons) is the number of personnel that can be invested in the period T (hours) during which the discrimination work is performed. Further, the number of images that can be discriminated in one unit period by one person is assumed to be W (time / person / sheet). When the number of images input during the period T is X (sheets), the number Xn (sheets) of indistinguishable estimated images is expressed by the following formula 1, assuming that the indistinguishable ratio is P. .
 Xn=X・P (式1) Xn = X · P (Formula 1)
 また、期間Tの間に判別できる画像の枚数はT・N/Wであるため、判別不可推定画像を全て確認するには、以下に例示する式2の条件を満たす必要がある。 In addition, since the number of images that can be discriminated during the period T is T · N / W, in order to check all the indistinguishable estimation images, the following formula 2 must be satisfied.
 Xn≦T・N/W (式2) Xn ≦ T · N / W (Formula 2)
 したがって、閾値設定部62は、判別不可比率P≦T・N/W・Xnを満たす閾値を決定する。そして、閾値設定部62は、決定した閾値を判別器30に通知し、通知した閾値を設定させる。図5に示す例において、例えば、種別Aの判別不可比率が8%と算出されたとする。この場合、閾値設定部62は、判別不可比率8%を満たす閾値のうち、もっとも高い閾値である75%を設定する閾値として決定する。 Therefore, the threshold setting unit 62 determines a threshold that satisfies the indistinguishable ratio P ≦ T · N / W · Xn. Then, the threshold setting unit 62 notifies the discriminator 30 of the determined threshold, and sets the notified threshold. In the example illustrated in FIG. 5, for example, it is assumed that the classification impossible ratio of type A is calculated as 8%. In this case, the threshold value setting unit 62 determines the threshold value that sets the highest threshold value, 75%, among the threshold values that satisfy the indistinguishable ratio 8%.
 また、閾値設定部62は、実際に入力される画像の判別結果から設定する閾値ごとに判別不可比率を算出してもよい。具体的には、閾値設定部62は、ユーザから判別不可比率の算出指示を受け取ると、画像入力部10に閾値算出用の画像を入力し、閾値を変えながら、入力した画像の数に対する判別不可画像の数に基づいて判別不可比率を算出し、算出結果を記憶部63に登録する。閾値算出用の画像には、例えば、テスト用データ記憶部32に記憶された画像データが用いられてもよい。 Further, the threshold setting unit 62 may calculate a non-discrimination ratio for each threshold set from the discrimination result of the actually input image. Specifically, when the threshold setting unit 62 receives an instruction to calculate the indistinguishable ratio from the user, the threshold setting unit 62 inputs an image for threshold calculation to the image input unit 10 and cannot determine the number of input images while changing the threshold. An indistinguishable ratio is calculated based on the number of images, and the calculation result is registered in the storage unit 63. For example, image data stored in the test data storage unit 32 may be used as the threshold calculation image.
 また、閾値設定部62は、ユーザからの明示的な算出指示を受け取る場合以外にも、画像の種別の確からしさを推定する状況の変化(例えば、画像種別推定部20によって使用されるモデルが更新された状況)に応じて、閾値ごとの判別不可比率を算出しなおしてもよい。すなわち、閾値設定部62は、第一の実施形態で評価部60がモデルを更新するタイミングで閾値を更新する場合と同じように、評価部60がモデルを更新するタイミングで閾値ごとの判別不可比率を算出し直して記憶部63の情報を更新してもよい。 In addition to receiving an explicit calculation instruction from the user, the threshold setting unit 62 changes the situation for estimating the likelihood of the image type (for example, the model used by the image type estimation unit 20 is updated). Depending on the situation, the indistinguishable ratio for each threshold value may be recalculated. That is, the threshold setting unit 62 determines the non-discrimination ratio for each threshold at the timing when the evaluation unit 60 updates the model, in the same manner as when the evaluation unit 60 updates the model at the timing when the evaluation unit 60 updates the model in the first embodiment. May be recalculated and the information in the storage unit 63 may be updated.
 なお、画像入力部10と、画像種別推定部20と、判別器30と、種別設定部40と、学習器50と、評価部60と、閾値設定部62とは、プログラム(画像判定プログラム)に従って動作するコンピュータのCPUによって実現される。 The image input unit 10, the image type estimation unit 20, the discriminator 30, the type setting unit 40, the learning device 50, the evaluation unit 60, and the threshold setting unit 62 are according to a program (image determination program). It is realized by a CPU of an operating computer.
 次に、本実施形態の画像判定システムの動作を説明する。図6は、第二の実施形態の画像判定システムの動作例を示すフローチャートである。閾値設定部62は、判別不可推定画像の数と、その判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて判別器30の閾値を設定する(ステップS21)。画像種別推定部20が、入力される画像の種別の確からしさを推定する(ステップS22)。そして、判別器30は、設定された閾値に基づいて、推定された種別の確からしさがその閾値を超える画像の種別を判別する(ステップS23)。 Next, the operation of the image determination system of this embodiment will be described. FIG. 6 is a flowchart illustrating an operation example of the image determination system according to the second embodiment. The threshold value setting unit 62 sets the threshold value of the discriminator 30 according to the number of indistinguishable estimated images and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimated images (step S21). The image type estimation unit 20 estimates the likelihood of the type of the input image (step S22). Then, the discriminator 30 discriminates the image type whose probability of the estimated type exceeds the threshold value based on the set threshold value (step S23).
 以上のように、本実施形態では、閾値設定部62が、判別不可推定画像の数とその判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて閾値を設定する。そして、判別器30が、画像種別推定部20によって推定される確からしさが、設定された閾値を超える画像の種別を判別する。 As described above, in the present embodiment, the threshold value setting unit 62 sets the threshold value according to the number of indistinguishable estimation images and the number of personnel that can be assigned to the discrimination of the indistinguishable estimation image type. Then, the discriminator 30 discriminates image types whose probability estimated by the image type estimation unit 20 exceeds the set threshold value.
 このような構成により、画像を判別する精度が低減することを抑制しつつ人員の状況に応じて画像を判別する基準を調整できる。そのため、種別の判別に割り当て可能な人員の範囲内で、判別できなかった画像(すなわち、判別不可画像)に対する判別処理を行うことが可能になる。 With such a configuration, it is possible to adjust the reference for discriminating the image according to the situation of the personnel while suppressing the reduction of the accuracy of discriminating the image. For this reason, it is possible to perform discrimination processing for an image that cannot be discriminated (that is, an image that cannot be discriminated) within the range of personnel that can be assigned to discriminate the type.
 次に、本発明の概要を説明する。図7は、本発明による画像判定システムの概要を示すブロック図である。本発明による画像判定システム90(例えば、画像判定システム200)は、入力される画像の種別の確からしさを推定する推定器(例えば、画像種別推定部20)によって出力される種別の確からしさが、設定された閾値を超える画像の種別を判別する判別部91(例えば、判別器30)と、閾値を設定する閾値設定部92(例えば、閾値設定部62)とを備えている。 Next, the outline of the present invention will be described. FIG. 7 is a block diagram showing an outline of an image determination system according to the present invention. The image determination system 90 (for example, the image determination system 200) according to the present invention has a likelihood of the type output by the estimator (for example, the image type estimation unit 20) that estimates the likelihood of the type of the input image. A discrimination unit 91 (eg, the discriminator 30) that discriminates the type of an image that exceeds the set threshold value, and a threshold setting unit 92 (eg, a threshold setting unit 62) that sets the threshold value are provided.
 閾値設定部92は、判別部91が種別を判別できないと推定される画像である判別不可推定画像の数と、その判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて閾値を設定する。 The threshold setting unit 92 determines the threshold according to the number of indistinguishable estimated images that are estimated to be the type that cannot be discriminated by the discriminating unit 91 and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimated image. Set.
 そのような構成により、画像を判別する精度が低減することを抑制しつつ人員の状況に応じて画像を判別する基準を調整できる。 With such a configuration, it is possible to adjust the reference for discriminating the image according to the situation of the personnel while suppressing the reduction of the accuracy of discriminating the image.
 具体的には、画像判定システム90は、判別部91が種別を判別できない画像の割合である判別不可比率と閾値との対応関係を記憶する記憶部(例えば、記憶部63)を備えていてもよい。そして、閾値設定部92は、判別対象の画像の数と判別不可比率とから判別不可推定画像の数を閾値ごとに算出し、算出された数の判別不可推定画像の判別に割り当て可能な人員に応じて閾値を設定してもよい。 Specifically, the image determination system 90 may include a storage unit (for example, the storage unit 63) that stores the correspondence relationship between the determination impossible ratio, which is the ratio of images for which the determination unit 91 cannot determine the type, and the threshold value. Good. Then, the threshold setting unit 92 calculates the number of indistinguishable estimated images for each threshold from the number of images to be discriminated and the indistinguishable ratio, and assigns the calculated number to the personnel that can be assigned to discriminate the indistinguishable estimated images. A threshold value may be set accordingly.
 その際、閾値設定部92は、割り当て可能な人員によって、算出された数の判別不可推定画像の判別が可能な閾値のうち、最も高い閾値を設定してもよい。 At that time, the threshold value setting unit 92 may set the highest threshold value among the threshold values that can be determined by the assignable personnel for the calculated number of indistinguishable estimation images.
 また、閾値設定部92は、入力した画像の数に対する判別不可画像の数に基づいて閾値を変えながら判別不可比率を算出し、算出結果を記憶部に登録してもよい。 Further, the threshold setting unit 92 may calculate the indistinguishable ratio while changing the threshold based on the number of indistinguishable images with respect to the number of input images, and may register the calculation result in the storage unit.
 また、閾値設定部92は、画像の種別の確からしさを推定する状況の変化(例えば、画像種別推定部20が用いるモデルの変化)に応じて閾値ごとの判別不可比率を算出してもよい。 Further, the threshold value setting unit 92 may calculate an indistinguishable ratio for each threshold value according to a change in the situation for estimating the likelihood of the image type (for example, a change in the model used by the image type estimation unit 20).
 以上、実施形態及び実施例を参照して本願発明を説明したが、本願発明は上記実施形態および実施例に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 As mentioned above, although this invention was demonstrated with reference to embodiment and an Example, this invention is not limited to the said embodiment and Example. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.
 この出願は、2018年5月8日に出願された日本特許出願2018-089928を基礎とする優先権を主張し、その開示の全てをここに取り込む。 This application claims priority based on Japanese Patent Application No. 2018-089928 filed on May 8, 2018, the entire disclosure of which is incorporated herein.
 10 画像入力部
 11 画像記憶器
 20 画像種別推定部
 21 AI
 30 判別器
 31 再学習用データ記憶部
 32 テスト用データ記憶部
 40 種別設定部
 50 学習器
 51 AI
 60 評価部
 61 評価器
 62 閾値設定部
 63 記憶部
 70 出力部
 Pa~Pb 処理器
 100,200 画像判定システム
DESCRIPTION OF SYMBOLS 10 Image input part 11 Image memory | storage device 20 Image classification estimation part 21 AI
30 Classifier 31 Re-learning Data Storage Unit 32 Test Data Storage Unit 40 Type Setting Unit 50 Learning Unit 51 AI
60 Evaluator 61 Evaluator 62 Threshold Setting Unit 63 Storage Unit 70 Output Unit Pa to Pb Processor 100, 200 Image Determination System

Claims (9)

  1.  入力される画像の種別の確からしさを推定する推定器によって出力される当該種別の確からしさが、設定された閾値を超える画像の種別を判別する判別部と、
     前記閾値を設定する閾値設定部とを備え、
     前記閾値設定部は、前記判別部が種別を判別できないと推定される画像である判別不可推定画像の数と、当該判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて前記閾値を設定する
     ことを特徴とする画像判定システム。
    A discriminator for discriminating a type of an image in which the likelihood of the type output by the estimator that estimates the likelihood of the type of the input image exceeds a set threshold;
    A threshold setting unit for setting the threshold,
    The threshold value setting unit is based on the number of indistinguishable estimation images that are estimated to be the type that the discrimination unit cannot discriminate and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimation image. An image determination system characterized by setting a threshold value.
  2.  判別部が種別を判別できない画像の割合である判別不可比率と閾値との対応関係を記憶する記憶部を備え、
     閾値設定部は、判別対象の画像の数と判別不可比率とから判別不可推定画像の数を閾値ごとに算出し、算出された数の判別不可推定画像の判別に割り当て可能な人員に応じて閾値を設定する
     請求項1記載の画像判定システム。
    A storage unit that stores a correspondence relationship between a threshold value and a non-discrimination ratio that is a ratio of an image for which the discrimination unit cannot discriminate;
    The threshold setting unit calculates the number of indistinguishable estimated images for each threshold from the number of images to be discriminated and the indistinguishable ratio, and determines the threshold according to the number of personnel that can be assigned to discriminate the calculated number of indistinguishable estimated images. The image determination system according to claim 1.
  3.  閾値設定部は、割り当て可能な人員によって、算出された数の判別不可推定画像の判別が可能な閾値のうち、最も高い閾値を設定する
     請求項2記載の画像判定システム。
    The image determination system according to claim 2, wherein the threshold value setting unit sets the highest threshold value among threshold values that can be determined by the assignable personnel for the calculated number of indistinguishable estimated images.
  4.  閾値設定部は、入力した画像の数に対する判別不可画像の数に基づいて閾値を変えながら判別不可比率を算出し、算出結果を記憶部に登録する
     請求項2または請求項3記載の画像判定システム。
    4. The image determination system according to claim 2, wherein the threshold setting unit calculates an indistinguishable ratio while changing the threshold based on the number of indistinguishable images with respect to the number of input images, and registers the calculation result in the storage unit. .
  5.  閾値設定部は、画像の種別の確からしさを推定する状況の変化に応じて閾値ごとの判別不可比率を算出する
     請求項4記載の画像判定システム。
    The image determination system according to claim 4, wherein the threshold setting unit calculates an indistinguishable ratio for each threshold according to a change in a situation in which the probability of the image type is estimated.
  6.  入力される画像の種別の確からしさを推定する推定器によって出力される当該種別の確からしさが、設定された閾値を超える画像の種別を判別し、
     種別を判別できないと推定される画像である判別不可推定画像の数と、当該判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて前記閾値を設定する
     ことを特徴とする画像判定方法。
    Determining the type of image for which the likelihood of the type output by the estimator that estimates the likelihood of the type of the input image exceeds a set threshold;
    The threshold value is set according to the number of indistinguishable estimated images that are estimated to be unable to discriminate the type and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimated image. Judgment method.
  7.  種別を判別できない画像の割合である判別不可比率と閾値との対応関係を記憶する記憶部を参照し、判別対象の画像の数と前記判別不可比率とから判別不可推定画像の数を前記閾値ごとに算出し、
     算出された数の判別不可推定画像の判別に割り当て可能な人員に応じて閾値を設定する
     請求項6記載の画像判定方法。
    With reference to a storage unit that stores a correspondence relationship between an indistinguishable ratio, which is a ratio of images whose type cannot be discriminated, and a threshold, the number of indistinguishable estimated images is determined for each threshold by using the number of images to be discriminated and the indistinguishable ratio To
    The image determination method according to claim 6, wherein a threshold value is set according to the number of personnel that can be assigned to determine the calculated number of indistinguishable estimated images.
  8.  コンピュータに、
     入力される画像の種別の確からしさを推定する推定器によって出力される当該種別の確からしさが、設定された閾値を超える画像の種別を判別する判別処理、および、
     前記閾値を設定する閾値設定処理を実行させ、
     前記閾値設定処理で、前記判別処理で種別を判別できないと推定される画像である判別不可推定画像の数と、当該判別不可推定画像の種別の判別に割り当て可能な人員の数とに応じて前記閾値を設定させる
     ための画像判定プログラム。
    On the computer,
    A discriminating process for discriminating the type of image in which the likelihood of the type output by the estimator that estimates the likelihood of the type of the input image exceeds a set threshold; and
    Executing threshold setting processing for setting the threshold;
    In the threshold value setting process, the number of indistinguishable estimation images that are estimated to be the type that cannot be discriminated in the discrimination process, and the number of personnel that can be assigned to discriminate the type of the indistinguishable estimation image An image judgment program for setting a threshold.
  9.  コンピュータに、
     閾値設定処理で、種別を判別できない画像の割合である判別不可比率と閾値との対応関係を記憶する記憶部を参照し、判別対象の画像の数と前記判別不可比率とから判別不可推定画像の数を前記閾値ごとに算出させ、算出された数の判別不可推定画像の判別に割り当て可能な人員に応じて閾値を設定させる
     請求項8記載の画像判定プログラム。
    On the computer,
    A threshold value setting process refers to a storage unit that stores a correspondence relationship between a threshold value that is a ratio of images whose type cannot be determined and a threshold value. The image determination program according to claim 8, wherein a number is calculated for each of the threshold values, and the threshold value is set according to the number of personnel that can be assigned to determine the calculated number of indistinguishable estimation images.
PCT/JP2019/006777 2018-05-08 2019-02-22 Image determination system, image determination method, and image determination program WO2019215994A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2020518152A JP6969678B2 (en) 2018-05-08 2019-02-22 Image judgment system, image judgment method and image judgment program

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2018089928 2018-05-08
JP2018-089928 2018-05-08

Publications (1)

Publication Number Publication Date
WO2019215994A1 true WO2019215994A1 (en) 2019-11-14

Family

ID=68467426

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2019/006777 WO2019215994A1 (en) 2018-05-08 2019-02-22 Image determination system, image determination method, and image determination program

Country Status (2)

Country Link
JP (1) JP6969678B2 (en)
WO (1) WO2019215994A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021099778A (en) * 2019-12-20 2021-07-01 ユーアイパス, インコーポレイテッドUiPath, Inc. Dynamic update for artificial intelligence/machine learning model, or retrain and update, in digital processes at runtime
JP6950911B1 (en) * 2020-07-31 2021-10-13 株式会社アイデミ— Demand forecast program, demand forecast device, demand forecast method, demand forecast notification program, demand forecast notification device and demand forecast notification method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007047981A (en) * 2005-08-09 2007-02-22 Toshiba Corp Production plan output program and production plan output device
JP2012190159A (en) * 2011-03-09 2012-10-04 Canon Inc Information processing device, information processing method, and program
JP2015122553A (en) * 2013-12-20 2015-07-02 キヤノン株式会社 Image processing apparatus, image processing method, and program
JP2016173682A (en) * 2015-03-16 2016-09-29 シャープ株式会社 Abnormality determination device and abnormality determination method
JP2016212812A (en) * 2015-05-14 2016-12-15 富士ゼロックス株式会社 Information processor and information processing program

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5750946B2 (en) * 2011-03-09 2015-07-22 株式会社リコー Content distribution system, content distribution server, content distribution method, program, and recording medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007047981A (en) * 2005-08-09 2007-02-22 Toshiba Corp Production plan output program and production plan output device
JP2012190159A (en) * 2011-03-09 2012-10-04 Canon Inc Information processing device, information processing method, and program
JP2015122553A (en) * 2013-12-20 2015-07-02 キヤノン株式会社 Image processing apparatus, image processing method, and program
JP2016173682A (en) * 2015-03-16 2016-09-29 シャープ株式会社 Abnormality determination device and abnormality determination method
JP2016212812A (en) * 2015-05-14 2016-12-15 富士ゼロックス株式会社 Information processor and information processing program

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021099778A (en) * 2019-12-20 2021-07-01 ユーアイパス, インコーポレイテッドUiPath, Inc. Dynamic update for artificial intelligence/machine learning model, or retrain and update, in digital processes at runtime
JP7088989B2 (en) 2019-12-20 2022-06-21 ユーアイパス,インコーポレイテッド Dynamic update, or retraining and updating of artificial intelligence / machine learning models in run-time digital processes
US11822913B2 (en) 2019-12-20 2023-11-21 UiPath, Inc. Dynamic artificial intelligence / machine learning model update, or retrain and update, in digital processes at runtime
JP6950911B1 (en) * 2020-07-31 2021-10-13 株式会社アイデミ— Demand forecast program, demand forecast device, demand forecast method, demand forecast notification program, demand forecast notification device and demand forecast notification method
JP2022026878A (en) * 2020-07-31 2022-02-10 株式会社アイデミ― Demand prediction program, demand predicting device, demand prediction method, demand prediction notification program, demand prediction notification device, and demand prediction notification method

Also Published As

Publication number Publication date
JPWO2019215994A1 (en) 2020-12-17
JP6969678B2 (en) 2021-11-24

Similar Documents

Publication Publication Date Title
JP7130984B2 (en) Image judgment system, model update method and model update program
CN110674880B (en) Network training method, device, medium and electronic equipment for knowledge distillation
US11210513B2 (en) Detection method and detection device
US8325999B2 (en) Assisted face recognition tagging
JP2020123330A (en) Method for acquiring sample image for label acceptance inspection from among auto-labeled images utilized for neural network learning, and sample image acquisition device utilizing the same
CN110462607B (en) Identifying reason codes from gradient boosters
WO2019215994A1 (en) Image determination system, image determination method, and image determination program
US20200042883A1 (en) Dictionary learning device, dictionary learning method, data recognition method, and program storage medium
JP7062474B2 (en) Deterioration status identification device, deterioration status identification method and program
US8644625B2 (en) Object detection device, method and program
US20220343163A1 (en) Learning system, learning device, and learning method
JP6988995B2 (en) Image generator, image generator and image generator
US11010692B1 (en) Systems and methods for automatic extraction of classification training data
WO2022024211A1 (en) Learning utilization system, utilization device, learning device, program, and learning utilization method
KR101854804B1 (en) Apparatus for providing user authentication service and training data by determining the types of named entities associated with the given text
WO2023173543A1 (en) Data classification model training method and apparatus, classification method and apparatus, device, and medium
US20210241021A1 (en) Information processing method and information processing system
WO2020166321A1 (en) Model learning device, label estimation device, method for these, and program
CN110414845B (en) Risk assessment method and device for target transaction
CN115394295A (en) Segmentation processing method, device, equipment and storage medium
KR101864301B1 (en) Apparatus and method for classifying data
US20240005655A1 (en) Learning apparatus, estimation apparatus, learning method, estimation method and program
US20220261690A1 (en) Computer-readable recording medium storing determination processing program, determination processing method, and information processing apparatus
WO2023067669A1 (en) Learning device, learning method, and learning program
US20240087098A1 (en) Training method and training device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19800765

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2020518152

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19800765

Country of ref document: EP

Kind code of ref document: A1