WO2022054124A1 - 画像判定装置、画像判定方法及びプログラム - Google Patents

画像判定装置、画像判定方法及びプログラム Download PDF

Info

Publication number
WO2022054124A1
WO2022054124A1 PCT/JP2020/033921 JP2020033921W WO2022054124A1 WO 2022054124 A1 WO2022054124 A1 WO 2022054124A1 JP 2020033921 W JP2020033921 W JP 2020033921W WO 2022054124 A1 WO2022054124 A1 WO 2022054124A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
object data
training
learning model
machine learning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2020/033921
Other languages
English (en)
French (fr)
Japanese (ja)
Inventor
永男 蔡
プリーサム プラキャッシャー
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Rakuten Group Inc
Original Assignee
Rakuten Group Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Rakuten Group Inc filed Critical Rakuten Group Inc
Priority to PCT/JP2020/033921 priority Critical patent/WO2022054124A1/ja
Priority to US17/419,285 priority patent/US12067708B2/en
Priority to JP2022548264A priority patent/JP7401688B2/ja
Publication of WO2022054124A1 publication Critical patent/WO2022054124A1/ja
Anticipated expiration legal-status Critical
Priority to JP2023206941A priority patent/JP7588203B2/ja
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0004Industrial image inspection
    • G06T7/0006Industrial image inspection using a design-rule based approach
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Definitions

  • the present invention relates to an image determination device, an image determination method and a program, and particularly to a technique for determining the substantial identity of a plurality of images.
  • a screen image of a computer application is manually designed on the assumption that it will be displayed in one selected execution environment such as a specific smartphone. Objects such as buttons, images, and input forms are placed on these screen images.
  • the screen image of substantially the same design is ported to other execution environments.
  • the plurality of screen images obtained in this way are often visually confirmed to be substantially identical.
  • the number of execution environments for computer applications increases, there is an increasing need to automate such verification tasks.
  • the screen size, aspect ratio, and resolution differ depending on the execution environment of the computer application.
  • the appearance of objects such as buttons provided by the execution environment such as the operation system and included in each screen image is not a little different. Therefore, even if a plurality of screen images are compared on a pixel-by-pixel basis, it is difficult to confirm their substantial identities. Further, it is conceivable to determine the substantial identity of the screen images by inputting the screen images into the machine learning model, but there is a concern that the learning amount becomes enormous.
  • the present invention has been made in view of the above problems, and an object thereof is to provide an image determination device, an image determination method, and a program capable of easily and correctly determining the substantial identity of a plurality of images in which objects are arranged. To do.
  • the image determination device uses a first machine learning model in which an image is input and object data indicating the attributes and arrangement of the objects in the image is output.
  • the first object data indicating the attribute and arrangement of the object in the first image is acquired from the first image
  • the second object data indicating the attribute and arrangement of the object in the second image is acquired from the second image.
  • the first object data acquisition means and the second machine learning model in which the first object data and the second object data are input and the substantial identity of the first image and the second image are output are used.
  • a determination means for determining the substantial identity of one image and the second image is included.
  • the first machine learning model may include R-CNN.
  • the first machine learning model may be trained by a training image generated by superimposing a part or all of a plurality of objects on a given base image.
  • the second machine learning model may include a fully connected layer.
  • the second machine learning model may include a convolution layer and a pooling layer on the upstream side of the fully connected layer, which reduces the number of dimensions of the first object data and the input data based on the second object data.
  • the second machine learning model is a first training image and a second trained image generated by superimposing a given object on each of the same or similar first and second base images according to a predetermined placement rule. Learned from the first learning object data showing the attributes and arrangement of the objects in the first training image and the second learning object data showing the attributes and arrangement of the objects in the second training image obtained from the training image. good.
  • the step of acquiring the first object data indicating the attributes and arrangement of the objects in the first image from the first image by using the first machine learning model a step of acquiring second object data indicating the attributes and arrangement of objects in the second image from the second image, and using the second machine learning model, A step of determining the substantial identity of the first image and the second image from the first object data and the second object data is included.
  • the first machine learning model may include R-CNN.
  • the above method may further include a step of performing training of the first machine learning model by a training image generated by superimposing a part or all of a plurality of objects on a given base image.
  • the second machine learning model may include a fully connected layer.
  • the second machine learning model may include a convolution layer and a pooling layer on the upstream side of the fully connected layer, which reduces the number of dimensions of the first object data and the input data based on the second object data. ..
  • the above method includes a step of generating a first training image and a second training image by superimposing a given object on each of the same or similar first and second base images according to a predetermined arrangement rule.
  • the first learning object data showing the attributes and arrangement of the objects in the first training image
  • the second learning object data By inputting each of the first training image and the second training image into the first machine learning model, the first learning object data showing the attributes and arrangement of the objects in the first training image, and the second learning object data.
  • a step of acquiring the second learning object data indicating the attributes and arrangement of the objects in the training image, a step of executing the learning of the second machine learning model by the first learning object data and the second learning object data, and a step of executing the learning. May further be included.
  • the program according to still another aspect of the present invention includes a step of acquiring first object data indicating the attributes and arrangement of objects in the first image from the first image by using the first machine learning model. Using the first machine learning model, the step of acquiring the second object data indicating the attributes and arrangement of the objects in the second image from the second image, and the second machine learning model are used. It is a program for causing a computer to execute a step of determining the substantial identity of the first image and the second image from the first object data and the second object data.
  • This program may be stored in a computer-readable information storage medium such as a magneto-optical disk or a semiconductor memory.
  • FIG. 1 is a functional block diagram of an image determination device according to an embodiment of the present invention.
  • Each functional block shown in the figure is realized by executing the image determination program according to the embodiment of the present invention on a computer configured mainly by a CPU and a memory.
  • the image determination device 10 shown in the figure determines the substantial identity of the two screen images.
  • Objects such as buttons are arranged on each screen image as described above.
  • the image determination device 10 determines the substantial identity of the two screen images based on the number of objects included in the two screen images, the attributes of each object, and the arrangement of each object in the screen image.
  • the attributes of the object are, for example, the type and color information of the object.
  • Types of objects include buttons, logo images, trademark images, input forms, and the like.
  • As the color information of the object information on one or more representative colors of the object, information on the average color, and the like may be adopted.
  • substantially identical to the two images means that the number of object images included in each image, the attributes and arrangement of each image match the predefined regular rules. Further, “there is no substantial identity” between the two images means that the number of object images included in each image, their respective attributes and arrangements meet the predefined negative rule.
  • the same is true.
  • the screen image in which only the object A is arranged at the same position and the object B is lost is not substantially the same as the screen image shown in the figure (a). Will be done. That is, it is determined that the screen images in which the number of objects does not match are not substantially the same.
  • a screen image in which the object A is arranged at the same position but the object B is arranged shifted in the horizontal direction is also a screen image shown in the figure (a). It is judged that there is no substantial identity with. That is, it is determined that the screen image in which the corresponding objects are arranged so as to be displaced in the horizontal direction is not substantially the same as the screen image shown in FIG.
  • a screen image having a different size of the object B as shown in the figure (d) and a screen image having a different size of the object A as shown in the figure (e) are shown in the figure (a). It is judged that there is substantial identity with the screen image shown. That is, when comparing two screen images, if the number, attributes, and arrangement of the objects are the same, it is determined that the two screen images are substantially the same even if the sizes of the objects are different.
  • the image determination device 10 has a first machine learning model, R-CNN (Regions with Convolutional Neural Networks) 12, and a second machine learning model. It is configured to include CNN (Convolutional Neural Networks) 14 which is a machine learning model.
  • CNN Convolutional Neural Networks
  • the object data A indicating the attributes and arrangement of the objects included in the screen image A is output.
  • the object data B indicating the attributes and arrangement of the objects included in the screen image B is output.
  • object data A and object data B are input to CNN14.
  • the CNN 14 outputs whether or not the screen image A and the screen image B have substantial identity based on the object data A and the object data B.
  • the screen image A and the screen image B are sequentially input to the R-CNN12 so that the object data A and the object data B are sequentially obtained.
  • the two R-CNN12s are arranged in parallel.
  • the object data B may be generated from the screen image B in parallel with the object data A being generated from the screen image A by providing the object data A in front of the CNN 14.
  • the R-CNN12 is a machine learning model for object detection, and is a known machine learning model for object detection such as the original R-CNN, Fast R-CNN, Faster R-CNN, and MASK R-CNN. May be configured by.
  • Faster R-CNN is adopted as an example of R-CNN12.
  • object data indicating the arrangement in the input image and the attributes of the object is output for each of the objects included in the image.
  • the placement of each object in the input image is specified in the object data, for example by an anchor box.
  • the CNN 14 includes a data integration unit 15, a multi-stage dimension reduction unit 16, a one-dimensional unit 17, and a multi-stage fully connected layer 18, which are composed of a convolution layer and a pooling layer, respectively. Includes.
  • the data integration unit 15 concatenates the object data A and the object data B output from the R-CNN 12 to generate the input data of 1.
  • the object data A and the object data B have one dimension indicating the order of the objects (for example, 300 data may be included) and two dimensions indicating the size and arrangement of the objects (for example, four in each dimension). It may be a four-dimensional tensor data consisting of one dimension (for example, 1024 data may be included) indicating the attributes of the object.
  • the data integration unit 15 may combine the object data A and the object B to obtain input data by, for example, extending the dimension of the ordinal number of the object to twice the size.
  • the objects included in the screen image A are assigned to the ordinal numbers 1 to 300 of the objects
  • the objects included in the screen image B are assigned to the ordinal numbers 301 to 600 of the objects.
  • the input data generated by the data integration unit 15 is dimensionally reduced by the dimension reduction unit 16 in a plurality of stages, and the two-dimensional intermediate data is output from the dimension reduction unit 16 in the final stage.
  • the one-dimensional unit 17 makes the intermediate data one-dimensional, and inputs the one-dimensional intermediate data to the fully connected layer 18 of the first stage.
  • the fully connected layer 18 in the final stage outputs a one-dimensional (may include two data) identity determination results from the one-dimensional intermediate data.
  • the identity determination result includes data showing the degree of identity of the screen image A and the screen image B to some extent, and data showing the degree to which there is no such identity.
  • CNN 14 shown in FIG. 4 it becomes possible to determine whether or not the screen image A and the screen image B are substantially identical from the attributes and arrangements of the objects included in each of the screen image A and the screen image B.
  • the object data A and the object data B are combined to obtain the input data, and the dimension reduction of the input data is performed.
  • the dimension reduction is performed for each of the object data A and the object data B. You may try to do it. That is, as shown in CNN 14 shown in FIG. 5, the dimension reduction unit 16a having a plurality of stages including the folding layer and the pooling layer reduces the dimension of the object data A, and the dimension reduction unit has the same function as the dimension reduction unit 16a. With 16b, the dimension reduction of the object data B may be performed in parallel with the dimension reduction of the object data A.
  • the outputs of the dimension reduction units 16a and 16b are two-dimensional intermediate data, respectively, and the data integration unit 19 converts each intermediate data into one dimension, concatenates them, and outputs one one-dimensional intermediate data.
  • the intermediate data output from the data integration unit 19 is input to the fully connected layer 18 of a plurality of stages, and the identity determination result is output from the final stage.
  • FIG. 6 is a diagram showing a configuration of a learning device 20 that learns the R-CNN 12.
  • Each functional block shown in the figure is realized by executing a program according to an embodiment of the present invention on a computer configured around a CPU and a memory.
  • the learning device 20 generates training data for generating training data used for learning of the training unit 21, R-CNN12, which executes learning of R-CNN12 by updating the internal parameters of R-CNN12.
  • the unit 22 includes a base image storage unit 24 for storing a base image used when generating training data, and an object image storage unit 26 for storing an object image used when similarly generating training data.
  • FIG. 7 shows an example of a base image stored in the base image storage unit 24.
  • the base image is image data in raster format which is the basis of the screen image
  • FIG. 6A shows a base image in which sentences are represented on the entire surface.
  • FIG. 3B shows a plain white base image.
  • FIG. C shows a base image consisting of a solid colored page top and a bottom showing the document.
  • FIG. 3D shows a base image divided into a plurality of blocks and a document is represented in each block.
  • the base image storage unit 24 stores a large number of such raster format base images.
  • FIG. 8 shows an example of an object image stored in the object image storage unit 26.
  • the object image is image data in raster format which is the basis of the screen image, and shows the appearance of an object such as a button or an input form arranged in the screen image.
  • the figure (a) shows a logo image which is a kind of object image. An object number "001" is assigned to the object image shown in the figure.
  • FIG. 3B shows an input form image which is a kind of object image.
  • An object number "002" is assigned to the object image shown in the figure.
  • the figure (c) shows a button image which is a kind of object image. An object number "003" is assigned to the object image shown in the figure.
  • the figure (d) shows a trademark image which is a kind of object image. An object number "004" is assigned to the object image shown in the figure.
  • the object image storage unit 26 stores a large number of such raster-format object images.
  • the object image storage unit 26 stores the object attribute table shown in FIG. 9 in addition to a large number of object images.
  • the object attribute table stores the attributes of each object image stored in the object image storage unit 26. Specifically, the object attribute table stores the type and color information of the object image related to each object number in association with each object number. For example, in the object attribute table, the type "logo" of the object image and the color information "red / white” of the object image are stored in association with the object number "001" of the object image shown in FIG. 8 (a). ing.
  • the training data generation unit 22 generates a large number of training data based on the data stored in the base image storage unit 24 and the object image storage unit 26.
  • the training data includes training images and correct answer data.
  • FIG. 10 shows an example of a training image generated by the training data generation unit 22.
  • the training image shown in FIG. 7A is the base image shown in FIG. 7B, the object image shown in FIG. 8A, the object image shown in FIG. 8D, and FIG. 8B. ), It is generated by superimposing the object image. Further, in the training image shown in FIG. 10 (b), the object image shown in FIG. 8 (b) and the object image shown in FIG. 8 (c) are superimposed on the base image shown in FIG. 7 (a). It was generated by this.
  • the base image used when generating the training image is randomly selected by the training data generation unit 22 from a large number of base images stored in the base image storage unit 24. Further, the object image to be superimposed on the base image is also randomly selected by the training data generation unit 22 from a large number of base images stored in the object image storage unit 24. Further, the arrangement (position and size) of each object image is also randomly determined by the training data generation unit 22.
  • FIG. 11 shows an example of correct answer data generated by the training data generation unit 22.
  • FIG. 10A shows correct answer data for the training image shown in FIG. 10A.
  • This correct answer data includes three object images in the training image, the arrangement of the first object image is "(XA1, YA1)-(XB1, YB1)", and the attribute is "logo, red”. -It is "white", the arrangement of the second object image is "(XA2, YA2)-(XB2, YB2)", the attribute is "mark, yellow”, and the arrangement of the third object image is It is indicated that "(XA3, YA3)-(XB3, YB3)" and the attribute is "form, white / black”. That is, the correct answer data indicates the number of object images included in the training image, and the arrangement and attributes of each object image.
  • FIG. 11 (b) shows the correct answer data for the training image shown in FIG. 10 (b).
  • the training data generation unit 22 When the training data generation unit 22 generates the training image, the object image to be superimposed on the base image is randomly selected, and the arrangement thereof is also randomly determined.
  • the training data generation unit 22 reads the attributes of the selected object image from the object attribute table shown in FIG. 9 and includes them in the correct answer data shown in FIG. 11, and also shows the arrangement of the determined object image in FIG. By including it in the correct answer data, the correct answer data corresponding to the training image is generated.
  • the training unit 21 executes the learning process of the R-CNN 12 using the training data generated by the training data generation unit 22. Specifically, the training images included in the training data are sequentially input to the R-CNN12, and the output of the object data is obtained. The difference between this output and the correct answer data included in the training data is calculated, and the internal parameters of R-CNN12 are updated so that this difference becomes small.
  • FIG. 12 is an operation flow diagram of the learning device 20.
  • the training data generation unit 22 randomly selects an object image stored in the object image storage unit 26 (S101).
  • the number of object images to be selected is also randomly determined.
  • the training data generation unit 22 randomly selects one base image from the base image storage unit 24 (S102).
  • the training data generation unit 22 generates a training image by arranging the selected object image with respect to the selected base image (S103). At this time, the training data generation unit 22 randomly determines the arrangement and size of each object image.
  • the training data generation unit 22 further generates the correct answer data exemplified in FIG. 11 according to the processing contents in S101 to S103, and includes it in the training data together with the training image generated in S103 (S104).
  • the training unit 21 executes the learning process of R-CNN12 using the generated training data (S106).
  • FIG. 13 is a diagram showing a configuration of a learning device 30 that learns the CNN 14.
  • the learning device 30 includes a training unit 31 that executes learning of the CNN 14 by updating the internal parameters of the CNN 14, a training data generation unit 32 that generates training data used for learning the CNN 14, and a base image storage.
  • a unit 24 and an object image storage unit 26 are included.
  • the training data generation unit 32 generates training data related to positive and negative examples by using the base image stored in the base image storage unit 24 and the object image stored in the object image storage unit 26.
  • Each training data includes a positive example, that is, a pair of training images that have substantial identity, or a negative example, that is, a pair of training images that do not have substantial identity.
  • FIG. 14 is a diagram showing training images related to positive and negative examples used for learning CNN14.
  • Figure (a) shows one example of a pair of training images.
  • the one training image is generated by arranging a plurality of randomly selected object images at random positions and at random sizes with respect to a randomly selected base image.
  • the object images A and B are arranged with respect to the base image in which the document is represented as a whole.
  • the figure (b) shows the other training image generated by applying the first negative rule.
  • the first negative rule a part of the object image included in one training image is removed.
  • the training image of the figure (b) does not include the object image B unlike the training image of the figure (a).
  • the figure (c) shows the other training image generated by applying the second negative rule.
  • the second negative rule a part of the object image included in one training image is moved horizontally.
  • the object image B is shifted to the left as compared with the training image of the figure (a).
  • the figure (d) shows the other training image generated by applying the regular rule.
  • a part of the object image included in one training image is enlarged or reduced.
  • the object image A is enlarged as compared with the training image of the figure (a).
  • the training unit 31 executes the learning process of CNN14 using the training data generated in this way. Specifically, the two training images included in the training data are sequentially input to the R-CNN12, and the two object data are acquired. Then, those object data are input to CNN14.
  • the training unit 31 acquires the identity determination result output from the CNN 14, and updates the internal parameters of the CNN 14 so that the identity determination result is correct. That is, when the training image according to the positive example is input to the R-CNN12, the internal parameters are updated so that the identity determination results output from the CNN 14 are substantially the same. On the contrary, when the training image according to the negative example is input to the R-CNN12, the internal parameters are updated so that the identity determination results output from the CNN 14 are not substantially the same.
  • FIG. 15 is an operation flow diagram of the learning device 30.
  • the training data generation unit 32 first randomly selects an object image from the object image storage unit 26 (S201). Further, the training data generation unit 32 randomly selects a base image from the base image storage unit 24 (S202). Then, the training data generation unit 32 generates the training image A by superimposing each object image on the selected base image (S203). At this time, the position and size of each object image are randomly determined.
  • the training data generation unit 32 applies the regular rule or the negative rule to superimpose all or a part of the selected object image on the selected base image to obtain the training image B.
  • Generate S204.
  • the training image B is generated without superimposing a part of the selected object images on the selected base image.
  • the training image B is made by moving a part of the selected object image to the right or left and then superimposing it on the selected base image.
  • the training image B is generated by enlarging or reducing a part of the selected object image and then superimposing it on the selected base image.
  • the training data generation unit 32 generates and stores training data including the generated pair of training images A and B and whether they are positive or negative examples (S205). ..
  • the training unit 31 executes the learning process of CNN14 using the generated training data (S207).
  • object data indicating the attributes and positions of the object images included in the two screen images to be compared can be obtained from each of the two screen images to be compared. Then, based on the two object data, it is determined whether or not the two screen images are substantially the same.
  • the CNN14 is pre-trained with a large number of training image pairs generated according to the regular and negative rules for determining the presence or absence of substantial identity. According to this embodiment, the substantial identity between the two screen images can be suitably determined.
  • the present inventors used 500 base images and 33 types of object images to generate 5000 training data for R-CNN12 and 8000 training data for CNN14, and R. -When the learning of CNN12 and CNN14 was carried out, the determination accuracy of the substantial identity of the screen image was about 86%, and it was found that it was sufficiently practical.
  • the scope of the present invention is not limited to the above embodiment, and includes various modifications.
  • the present invention can be similarly applied not only to screen images but also to various images such as page images of electronic books and web content images.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Medical Informatics (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Image Analysis (AREA)
PCT/JP2020/033921 2020-09-08 2020-09-08 画像判定装置、画像判定方法及びプログラム Ceased WO2022054124A1 (ja)

Priority Applications (4)

Application Number Priority Date Filing Date Title
PCT/JP2020/033921 WO2022054124A1 (ja) 2020-09-08 2020-09-08 画像判定装置、画像判定方法及びプログラム
US17/419,285 US12067708B2 (en) 2020-09-08 2020-09-08 Image judgement apparatus, image judgement method and non-transitory computer readable medium
JP2022548264A JP7401688B2 (ja) 2020-09-08 2020-09-08 画像判定装置、画像判定方法及びプログラム
JP2023206941A JP7588203B2 (ja) 2020-09-08 2023-12-07 画像判定装置、画像判定方法及びプログラム

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2020/033921 WO2022054124A1 (ja) 2020-09-08 2020-09-08 画像判定装置、画像判定方法及びプログラム

Publications (1)

Publication Number Publication Date
WO2022054124A1 true WO2022054124A1 (ja) 2022-03-17

Family

ID=80631391

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2020/033921 Ceased WO2022054124A1 (ja) 2020-09-08 2020-09-08 画像判定装置、画像判定方法及びプログラム

Country Status (3)

Country Link
US (1) US12067708B2 (https=)
JP (2) JP7401688B2 (https=)
WO (1) WO2022054124A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2024039974A (ja) * 2022-09-12 2024-03-25 日本電気株式会社 レイアウト支援システム、レイアウト支援装置、情報蓄積装置、レイアウト支援方法、およびレイアウト支援プログラム

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2018169690A (ja) * 2017-03-29 2018-11-01 日本電信電話株式会社 画像処理装置、画像処理方法及び画像処理プログラム
US20190073560A1 (en) * 2017-09-01 2019-03-07 Sri International Machine learning system for generating classification data and part localization data for objects depicted in images
JP2019219766A (ja) * 2018-06-15 2019-12-26 株式会社Lixil 分析装置、分析システム、及び分析プログラム
JP2020088416A (ja) * 2018-11-15 2020-06-04 シャープ株式会社 電子機器および制御装置
JP2020107185A (ja) * 2018-12-28 2020-07-09 Kddi株式会社 画像認識装置、画像認識方法及びプログラム

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200746037A (en) * 2006-02-14 2007-12-16 Sony Corp Display control device and display control method
JP4696099B2 (ja) * 2007-08-07 2011-06-08 日立オムロンターミナルソリューションズ株式会社 表示画像変換装置
US8315423B1 (en) * 2007-12-28 2012-11-20 Google Inc. Providing information in an image-based information retrieval system
JP5485781B2 (ja) * 2010-05-10 2014-05-07 株式会社ザクティ 電子カメラ
US9002109B2 (en) * 2012-10-09 2015-04-07 Google Inc. Color correction based on multiple images
JP2016218977A (ja) * 2015-05-26 2016-12-22 富士通株式会社 比較プログラム、情報処理装置および比較方法
US9767381B2 (en) * 2015-09-22 2017-09-19 Xerox Corporation Similarity-based detection of prominent objects using deep CNN pooling layers as features
WO2017210908A1 (zh) * 2016-06-08 2017-12-14 华为技术有限公司 处理方法与终端
JP6832504B2 (ja) * 2016-08-08 2021-02-24 パナソニックIpマネジメント株式会社 物体追跡方法、物体追跡装置およびプログラム
US11308350B2 (en) * 2016-11-07 2022-04-19 Qualcomm Incorporated Deep cross-correlation learning for object tracking
US10210627B1 (en) * 2017-01-23 2019-02-19 Amazon Technologies, Inc. Image processing system for determining metrics of objects represented therein
US20180268307A1 (en) * 2017-03-17 2018-09-20 Yahoo Japan Corporation Analysis device, analysis method, and computer readable storage medium
CN107065247A (zh) * 2017-06-07 2017-08-18 惠科股份有限公司 一种显示面板检测方法、装置及系统
US10794710B1 (en) * 2017-09-08 2020-10-06 Perceptin Shenzhen Limited High-precision multi-layer visual and semantic map by autonomous units
US10902615B2 (en) * 2017-11-13 2021-01-26 Qualcomm Incorporated Hybrid and self-aware long-term object tracking
US10885336B1 (en) * 2018-01-13 2021-01-05 Digimarc Corporation Object identification and device communication through image and audio signals
US10489126B2 (en) * 2018-02-12 2019-11-26 Oracle International Corporation Automated code generation
US11100400B2 (en) * 2018-02-15 2021-08-24 Adobe Inc. Generating visually-aware item recommendations using a personalized preference ranking network
CN110580486B (zh) * 2018-06-07 2024-04-12 斑马智行网络(香港)有限公司 一种数据处理方法、装置、电子设备和可读介质
WO2020070943A1 (ja) * 2018-10-03 2020-04-09 株式会社Nttドコモ パターン認識装置及び学習済みモデル
US11354791B2 (en) * 2018-12-19 2022-06-07 General Electric Company Methods and system for transforming medical images into different styled images with deep neural networks
CN109635141B (zh) * 2019-01-29 2021-04-27 京东方科技集团股份有限公司 用于检索图像的方法、电子设备和计算机可读存储介质
AU2019200976A1 (en) * 2019-02-12 2020-08-27 Canon Kabushiki Kaisha Method, system and apparatus for generating training samples for matching objects in a sequence of images
WO2020181066A1 (en) * 2019-03-06 2020-09-10 Trax Technology Solutions Pte Ltd. Methods and systems for monitoring products
US11551348B2 (en) * 2019-04-09 2023-01-10 KLA Corp. Learnable defect detection for semiconductor applications
US10607331B1 (en) * 2019-06-28 2020-03-31 Corning Incorporated Image segmentation into overlapping tiles
US10748650B1 (en) * 2019-07-17 2020-08-18 Richard Ricci Machine learning of dental images for E-commerce
CN112446832A (zh) * 2019-08-31 2021-03-05 华为技术有限公司 一种图像处理方法及电子设备
US11513670B2 (en) * 2020-04-27 2022-11-29 Automation Anywhere, Inc. Learning user interface controls via incremental data synthesis
US20210374947A1 (en) * 2020-05-26 2021-12-02 Nvidia Corporation Contextual image translation using neural networks
US12111646B2 (en) * 2020-08-03 2024-10-08 Automation Anywhere, Inc. Robotic process automation with resilient playback of recordings
KR102430029B1 (ko) * 2020-08-26 2022-08-05 엔에이치엔클라우드 주식회사 딥러닝 기반 유사상품 검색 결과 제공 방법 및 그 시스템

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2018169690A (ja) * 2017-03-29 2018-11-01 日本電信電話株式会社 画像処理装置、画像処理方法及び画像処理プログラム
US20190073560A1 (en) * 2017-09-01 2019-03-07 Sri International Machine learning system for generating classification data and part localization data for objects depicted in images
JP2019219766A (ja) * 2018-06-15 2019-12-26 株式会社Lixil 分析装置、分析システム、及び分析プログラム
JP2020088416A (ja) * 2018-11-15 2020-06-04 シャープ株式会社 電子機器および制御装置
JP2020107185A (ja) * 2018-12-28 2020-07-09 Kddi株式会社 画像認識装置、画像認識方法及びプログラム

Also Published As

Publication number Publication date
JP7588203B2 (ja) 2024-11-21
JPWO2022054124A1 (https=) 2022-03-17
US20220309648A1 (en) 2022-09-29
JP2024015273A (ja) 2024-02-01
JP7401688B2 (ja) 2023-12-19
US12067708B2 (en) 2024-08-20

Similar Documents

Publication Publication Date Title
CN108352083B (zh) 用于拉伸成3d对象的2d图像处理
CN106062824B (zh) 边缘检测装置和边缘检测方法
GB2559446A (en) Generating a three-dimensional model from a scanned object
CN112534443A (zh) 图像处理设备及其操作方法
JP2018190329A (ja) 画像処理装置、画像処理プログラム、画像処理システム
JP2015232869A (ja) 画像処理装置、画像処理方法及び画像処理プログラム
WO2017050083A1 (zh) 一种元件识别方法及装置
JP7588203B2 (ja) 画像判定装置、画像判定方法及びプログラム
CN106778881A (zh) 数字印花方法及装置
KR102810293B1 (ko) 입력 이미지로부터 전력선을 검출하는 인공신경망의 학습 방법
CN114820834A (zh) 一种效果处理方法、装置、设备及存储介质
CN113658280B (zh) 基于人工智能的数据增广方法、装置、设备和存储介质
KR20160066511A (ko) 마커 인식 장치, 마커 인식 방법 및 기록 매체에 저장된 마커 인식용 컴퓨터 프로그램
CN112927321B (zh) 基于神经网络的图像智能设计方法、装置、设备及存储介质
US20240311997A1 (en) Semiconductor image processing apparatus and semiconductor image processing method
US12277633B2 (en) Create editable vectorized image from bitmap image
JP7270314B1 (ja) 検査方法、検査システム、ニューラルネットワーク
WO2023032216A1 (ja) 点群補完装置、点群補完方法、及び点群補完プログラム
CN114390265B (zh) 投射区域的设定辅助方法、设定辅助系统以及记录介质
US20240127507A1 (en) Image processing method and image processing apparatus
KR20180061496A (ko) 초해상화 방법
WO2021182345A1 (ja) 学習データ作成装置、方法、プログラム、学習データ及び機械学習装置
CN114820874A (zh) 一种文字的阴影效果处理方法、装置、设备及存储介质
Calvo et al. OpenCV 3. x with Python By Example: Make the most of OpenCV and Python to build applications for object recognition and augmented reality
JP2010108285A (ja) 画像処理方法およびそのコンピュータプログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20953192

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2022548264

Country of ref document: JP

Kind code of ref document: A

122 Ep: pct application non-entry in european phase

Ref document number: 20953192

Country of ref document: EP

Kind code of ref document: A1