WO2020000879A1 - Procédé et appareil de reconnaissance d'image - Google Patents

Procédé et appareil de reconnaissance d'image Download PDF

Info

Publication number
WO2020000879A1
WO2020000879A1 PCT/CN2018/116335 CN2018116335W WO2020000879A1 WO 2020000879 A1 WO2020000879 A1 WO 2020000879A1 CN 2018116335 W CN2018116335 W CN 2018116335W WO 2020000879 A1 WO2020000879 A1 WO 2020000879A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
identified
screenshot
recognition
recognition result
Prior art date
Application number
PCT/CN2018/116335
Other languages
English (en)
Chinese (zh)
Inventor
周恺卉
王长虎
Original Assignee
北京字节跳动网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字节跳动网络技术有限公司 filed Critical 北京字节跳动网络技术有限公司
Publication of WO2020000879A1 publication Critical patent/WO2020000879A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures

Definitions

  • Embodiments of the present application relate to the field of computer technology, and in particular, to an image recognition method and device.
  • the embodiments of the present application provide an image recognition method and device.
  • an embodiment of the present application provides an image recognition method.
  • the method includes: acquiring an image to be identified; inputting the image to be identified into a pre-trained screen image recognition model to obtain a characterization for whether the image to be identified is a screen image Of the recognition result, wherein the screen capture image recognition model is used to characterize the correspondence between the image to be recognized and the recognition result; and in response to the recognition result indicating that the image to be recognized is a screenshot image, the image to be recognized is deleted.
  • the method before acquiring the image to be identified, includes: acquiring a target image; and capturing a preset region of the target image as the image to be identified.
  • the method before acquiring the image to be identified, includes: acquiring a frame sequence of the target video; and selecting a target frame in the frame sequence of the target video as the image to be identified.
  • the screenshot image recognition model is obtained by training in the following steps: obtaining a training sample set, where the training sample includes a sample image and label information used to characterize whether the sample image is a screenshot image; The sample image is used as input, and the label information corresponding to the input sample image is used as the desired output, and a screenshot image recognition model is trained.
  • the method further includes: in response to the recognition result indicating that the image to be recognized is not a screenshot image, performing text recognition on the image to be recognized to obtain the recognition result; determining whether the recognition result includes a preset text; and in response to determining the recognition result Contains preset text to delete images to be identified.
  • an embodiment of the present application provides an image recognition device, the device includes: an image to be identified acquisition unit configured to obtain the image to be identified; an identification unit configured to input the image to be identified into a pre-trained screenshot An image recognition model to obtain a recognition result used to characterize whether the image to be recognized is a screenshot image, wherein the screenshot image recognition model is used to represent the correspondence between the image to be recognized and the recognition result; a first deletion unit configured to respond to the recognition result
  • the image to be identified is a screenshot image, and the image to be identified is deleted.
  • the apparatus further includes: a pushing unit configured to indicate that the image to be identified is not a screenshot image in response to the recognition result, and to push information for indicating that the image to be identified is not a screenshot image.
  • the apparatus further includes: a target image acquisition unit configured to acquire a target image; and a capture unit configured to intercept a preset region of the target image as an image to be identified.
  • the apparatus further includes: a frame sequence acquisition unit configured to acquire a frame sequence of the target video; and a selection unit configured to select a target frame in the frame sequence of the target video as an image to be identified.
  • the screenshot image recognition model is obtained by training in the following steps: obtaining a training sample set, where the training sample includes a sample image and label information used to characterize whether the sample image is a screenshot image; The sample image is used as input, and the label information corresponding to the input sample image is used as the desired output, and a screenshot image recognition model is trained.
  • the apparatus further includes: a recognition unit configured to indicate that the image to be recognized is not a screenshot image in response to the recognition result, to perform text recognition on the image to be recognized to obtain a recognition result; and a determination unit configured to determine the recognition result Whether to include a preset text; and a second deleting unit configured to delete the image to be recognized in response to determining that the recognition result includes the preset text.
  • a recognition unit configured to indicate that the image to be recognized is not a screenshot image in response to the recognition result, to perform text recognition on the image to be recognized to obtain a recognition result
  • a determination unit configured to determine the recognition result Whether to include a preset text
  • a second deleting unit configured to delete the image to be recognized in response to determining that the recognition result includes the preset text.
  • an embodiment of the present application provides an electronic device.
  • the electronic device includes: one or more processors; a storage device that stores one or more programs thereon; Or multiple processors execute, so that the above one or more processors implement the method as described in any implementation manner of the first aspect.
  • an embodiment of the present application provides a computer-readable medium on which a computer program is stored.
  • a computer program is stored.
  • the image recognition method and device provided in the embodiments of the present application recognize a to-be-recognized image by taking a screenshot image recognition model. If the image to be identified is a screenshot, delete it. Thus, the recognition of the image to be recognized and the deletion of the screenshot image are realized. Among them, because the screen capture image recognition model is used, compared with manual review, the efficiency of image review and recognition is improved.
  • FIG. 1 is an exemplary system architecture diagram to which an embodiment of the present application can be applied;
  • FIG. 1 is an exemplary system architecture diagram to which an embodiment of the present application can be applied;
  • FIG. 2 is a flowchart of an embodiment of an image recognition method according to the present application.
  • FIG. 3 is a schematic diagram of an application scenario of the image recognition method according to the present application.
  • FIG. 5 is a schematic structural diagram of an embodiment of an image recognition apparatus according to the present application.
  • FIG. 6 is a schematic structural diagram of a computer system suitable for implementing a server according to an embodiment of the present application.
  • FIG. 1 illustrates an exemplary system architecture 100 to which an image recognition method or an image recognition apparatus of an embodiment of the present application can be applied.
  • the system architecture 100 may include terminal devices 101, 102, and 103, a network 104, and a server 105.
  • the network 104 is a medium for providing a communication link between the terminal devices 101, 102, 103 and the server 105.
  • the network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
  • the user can use the terminal devices 101, 102, 103 to interact with the server 105 through the network 104 to receive or send messages and the like.
  • Various communication client applications can be installed on the terminal devices 101, 102, and 103, such as photographing applications, picture processing applications, instant messaging tools, email clients, social platform software, and the like.
  • the terminal devices 101, 102, and 103 may be hardware or software.
  • the terminal devices 101, 102, and 103 can be various electronic devices with support for storing and transmitting images, including, but not limited to, smart phones, tablet computers, laptop computers, and desktop computers.
  • the terminal devices 101, 102, and 103 are software, they can be installed in the electronic devices listed above. It can be implemented as multiple software or software modules (for example, to provide distributed services), or it can be implemented as a single software or software module. It is not specifically limited here.
  • the server 105 may be a server that provides various services, such as a background server that processes images stored in the terminal devices 101, 102, and 103.
  • the background server may process the received image (for example, identify whether it is a screenshot image), and perform corresponding processing according to the processing result (for example, the recognition result).
  • the image recognition method provided in the embodiment of the present application is generally executed by the server 105, and accordingly, the image recognition device is generally provided in the server 105.
  • the server may be hardware or software.
  • the server can be implemented as a distributed server cluster consisting of multiple servers or as a single server.
  • the server can be implemented as multiple software or software modules (for example, to provide distributed services), or it can be implemented as a single software or software module. It is not specifically limited here.
  • terminal devices, networks, and servers in FIG. 1 are merely exemplary. Depending on the implementation needs, there can be any number of terminal devices, networks, and servers.
  • the image recognition method includes the following steps:
  • Step 201 Obtain an image to be identified.
  • an execution subject of the image recognition method may obtain an image to be recognized from a terminal in a wired connection manner or a wireless connection manner.
  • the image to be identified may also be stored locally on the execution subject. At this time, the execution subject may directly obtain the image to be identified from the local.
  • the image to be identified may be any image that needs to be identified. In practice, the images to be identified can be specified by a technician or filtered according to certain conditions.
  • the method may include: acquiring a frame sequence of the target video; and selecting a target frame in the frame sequence of the target video as the image to be identified.
  • the target video can be any video.
  • the determination of the target video can be specified by a technician, or it can be filtered according to certain conditions.
  • the target frame may be at least one frame in the above-mentioned frame sequence.
  • the target frame can be specified by a technician, or it can be filtered according to certain conditions.
  • the condition may be: one frame is drawn at 2 second intervals.
  • step 202 an image to be identified is input to a pre-trained screen image recognition model, and a recognition result for characterizing whether the image to be identified is a screen image is obtained.
  • the above-mentioned execution subject may input an image to be recognized into a pre-trained screen image recognition model.
  • a recognition result for characterizing whether the image to be recognized is a screenshot image
  • the screenshot image may be an image recording content displayed on a screen of the electronic device.
  • the recognition results can take many forms. For example, you can use a number to indicate whether the image to be identified is a screenshot. Specifically, “1” may be used to indicate that the image to be identified is a screenshot image. Use "0" to indicate that the image to be identified is not a screenshot.
  • the recognition result may also be a value between 0 and 1, which is used to indicate the probability that the image to be recognized is a screenshot image.
  • the recognition results can also be text, characters, and so on.
  • the form of the recognition result is not limited.
  • the screenshot image recognition model is used to characterize the correspondence between the image to be recognized and the recognition result.
  • the screenshot image model may be a correspondence table storing a large number of images (including screenshot images or non-screenshot images) and recognition results corresponding to the images.
  • the correspondence relationship table may be generated based on statistics of a large number of images and recognition results.
  • the above-mentioned execution subject can match the image to be identified with a large number of images in the correspondence table.
  • a preset threshold for example, 95%) can be determined.
  • the recognition result corresponding to the determined image may be used as the recognition result of the image to be recognized.
  • the above screenshot image recognition model may also be a neural network.
  • the neural network abstracts the human brain neuron network from the perspective of information processing, establishes some simple model, and forms different networks according to different connection methods. Usually consists of a large number of nodes (or neurons) connected to each other, each node represents a specific output function, called the excitation function. The connection between each two nodes represents a weighted value for the signal passing through the connection, called a weight (also called a parameter), and the output of the network varies according to the connection mode, weight value and incentive function of the network.
  • a neural network usually includes multiple layers, and each layer includes multiple nodes. Generally, the nodes of the same layer can have the same weight, and the nodes of different layers can have different weights, so the parameters of multiple layers of the neural network can also be different.
  • Step 203 In response to the recognition result indicating that the image to be recognized is a screenshot image, the image to be recognized is deleted.
  • the execution subject may delete the image to be recognized.
  • the above-mentioned execution subject may also push information for indicating that the image to be identified is not a screenshot image.
  • text recognition is performed on the to-be-recognized image to obtain the recognition result; determining whether the recognition result includes a preset text; and responding to determining the recognition
  • the result contains preset text, and the image to be recognized is deleted.
  • the above-mentioned execution subject may perform character recognition on the image to be recognized through various methods to obtain the recognition result.
  • the recognition result may be related information of the text displayed in the image to be recognized.
  • OCR Optical Character Recognition
  • the execution body may determine whether the recognition result (for example, the obtained text) contains a preset text (for example, the name of an operator, etc.). If so, the execution subject may delete the image to be identified.
  • FIG. 3 is a schematic diagram of an application scenario of the image recognition method according to this embodiment.
  • the execution body of the image recognition method is the server 300.
  • the server 300 may first obtain an image 301 to be identified from a terminal. Then, the to-be-recognized image 301 is input to a pre-trained screen image recognition model to obtain a recognition result. If the recognition result indicates that the to-be-recognized image 301 is a screenshot image, the to-be-recognized image 301 is deleted.
  • the image recognition method provided by the above embodiments of the present application uses a screen capture image recognition model to identify an image to be recognized. If the image to be identified is a screenshot, delete it. Thus, the recognition of the image to be recognized and the deletion of the screenshot image are realized. Among them, because the screen capture image recognition model is used, compared with manual review, the efficiency of image review and recognition is improved.
  • FIG. 4 illustrates a process 400 of still another embodiment of the image recognition method.
  • the process 400 of the image recognition method includes the following steps:
  • Step 401 Obtain a target image.
  • the execution subject of the image recognition method may obtain the target image from the terminal in a wired connection or a wireless connection.
  • the target image can be any image.
  • the target image can be specified by a technician, or it can be filtered based on preset conditions.
  • the target image may be stored locally in the execution subject. At this time, the execution subject may also directly obtain the target image from the local.
  • Step 402 Capture a preset area of the target image as the image to be identified.
  • the execution subject may intercept a preset area of the target image as the image to be identified.
  • the preset area may be a part or all of the target image. For example, it can be the upper fifth area.
  • the above-mentioned execution subject may intercept the preset area of the target image in various ways. For example, through some screenshot applications or image processing applications.
  • Step 403 Acquire an image to be identified.
  • the execution subject may obtain the to-be-recognized image obtained in step 402. Because the image to be identified is obtained in step 402, it can generally be obtained directly from the local.
  • Step 404 Input the image to be identified into a pre-trained screen image recognition model, and obtain a recognition result used to characterize whether the image to be identified is a screen image.
  • the above-mentioned screenshot image recognition model may be a model obtained by training an image classification network, such as a Convolutional Neural Network (CNN), based on multiple training samples using a machine learning method.
  • CNN Convolutional Neural Network
  • the convolutional neural network can be a kind of feed-forward neural network, and its artificial neurons can respond to a part of the surrounding cells in the coverage area, and it has excellent performance for image processing.
  • a convolutional neural network may include a convolutional layer, a pooling layer, a depooling layer, and a deconvolution layer.
  • the convolution layer can be used to extract image features.
  • the pooling layer can be used to downsample the input information.
  • the depooling layer can be used to upsample the input information
  • the deconvolution layer is used to deconvolve the input information
  • the transposition of the convolution kernel of the convolution layer is used as the deconvolution layer.
  • the convolution kernel processes the input information.
  • the above screenshot image recognition model can be trained by the following steps:
  • the first step is to obtain a training sample set, where each training sample includes a sample image and annotation information used to characterize whether the sample image is a screenshot image.
  • each training sample includes a sample image and annotation information used to characterize whether the sample image is a screenshot image.
  • the label information may be in various forms.
  • the label information may be a numerical value. For example, "0" indicates that it is not a screenshot image, and "1" indicates that it is a screenshot image.
  • the label information may also be text, characters, and so on.
  • the sample image of the training samples in the training sample set is used as input, and the label information corresponding to the input sample image is used as the desired output, and a screenshot image recognition model is trained.
  • the sample images of the training samples can be input into the initial image classification network.
  • the initial image classification network may be various image classification networks. As an example, it may be a residual network (Residual Network, ResNet), VGG, or the like.
  • VGG is a classification model proposed by the Visual Geometry Group (VGG) of a university.
  • an initial value can be set for the initial image classification network. For example, it could be some different small random numbers. The "small random number” is used to ensure that the network does not enter a saturation state due to excessive weights, which causes training failure. "Different" is used to ensure that the network can learn normally. After that, the recognition result of the input sample image can be obtained.
  • the machine learning method is used to train the initial image classification network. Specifically, the difference between the recognition result and the label information calculated by using a preset loss function can be used first. Then, based on the obtained differences, the parameters of the initial image classification network can be adjusted, and if the preset training end condition is met, the training is ended, and the trained initial image classification network is used as a screenshot image recognition model.
  • the training end condition here includes but is not limited to at least one of the following: the training time exceeds a preset duration; the number of training times reaches a preset number of times; and the calculated difference is less than a preset difference threshold.
  • BP Back Propagation, Back Propagation
  • SGD Spochastic Gradient Descent, Stochastic Gradient Descent
  • the execution subject of the training step and the image recognition method may be the same or different. If they are the same, the execution subject can store the network structure and parameter values of the trained image recognition model locally after training to obtain the screen image recognition model. If they are different, after the training subject obtains a screen capture image recognition model from training, the network structure and parameter values of the model may be sent to the image recognition method execution subject.
  • Step 405 In response to the recognition result indicating that the image to be recognized is a screenshot image, the image to be recognized is deleted.
  • step 405 For the specific processing of step 405 and the technical effects brought by it, reference may be made to step 203 of the embodiment corresponding to FIG. 2, and details are not described herein again.
  • the process 400 of the image recognition method in this embodiment adds an image interception step, thereby reducing unnecessary interference information in the image and improving image recognition. Accuracy.
  • this application provides an embodiment of an image recognition device.
  • the device embodiment corresponds to the method embodiment shown in FIG. 2.
  • the device may specifically Used in various electronic equipment.
  • the image recognition device 500 in this embodiment includes an image acquisition unit 501, an image recognition unit 502, and a first deletion unit 503.
  • the image to-be-identified unit 501 is configured to acquire an image to be identified.
  • the image recognition unit 502 is configured to input a to-be-recognized image into a pre-trained screenshot recognition model to obtain a recognition result used to characterize whether the to-be-recognized image is a screenshot image, where the screenshot-recognition model is used to represent the Correspondence of recognition results.
  • the first deleting unit 503 is configured to delete the image to be identified in response to the recognition result indicating that the image to be identified is a screenshot image.
  • the apparatus 500 may further include: a push unit (not shown in the figure).
  • the pushing unit is configured to indicate that the image to be identified is not a screenshot image in response to the recognition result, and to push information for indicating that the image to be identified is not a screenshot image.
  • the apparatus 500 may further include: a target image acquisition unit (not shown in the figure) and a capture unit (not shown in the figure).
  • the target image acquisition unit is configured to acquire a target image.
  • the capturing unit is configured to capture a preset area of the target image as an image to be identified.
  • the apparatus 500 further includes: a frame sequence acquisition unit and a selection unit.
  • the frame sequence obtaining unit is configured to obtain a frame sequence of a target video.
  • the selection unit is configured to select a target frame in a frame sequence of the target video as an image to be identified.
  • the screenshot image recognition model is obtained by training in the following steps: obtaining a training sample set, where the training sample includes a sample image and annotation information used to characterize whether the sample image is a screenshot image; The sample images of the training samples in the training sample set are used as input, and the label information corresponding to the input sample images is used as the desired output, and a screenshot image recognition model is trained.
  • the apparatus 500 may further include: an identifying unit (not shown in the figure), a determining unit (not shown in the figure), and a second deleting unit (not shown in the figure) .
  • the recognition unit is configured to respond to the recognition result to indicate that the image to be recognized is not a screenshot image, and perform text recognition on the image to be recognized to obtain the recognition result;
  • the determination unit is configured to determine whether the recognition result includes a preset text;
  • the second deletion unit Configured to delete the image to be recognized in response to determining that the recognition result includes a preset text.
  • the above-mentioned image recognition unit 502 inputs the to-be-recognized image obtained by the to-be-recognized image acquisition unit 501 into a pre-trained screen image recognition model, and recognizes the to-be-recognized image. If the image to be identified is a screenshot image, it is deleted by the first deleting unit 503. Thus, the recognition of the image to be recognized and the deletion of the screenshot image are realized. Among them, because the screen capture image recognition model is used, compared with manual review, the efficiency of image review and recognition is improved.
  • FIG. 6 shows a schematic structural diagram of a computer system 600 suitable for implementing a server according to an embodiment of the present application.
  • the server shown in FIG. 6 is only an example, and should not impose any limitation on the functions and scope of use of the embodiments of the present application.
  • the computer system 600 includes a central processing unit (CPU) 601, which can be loaded into a random access memory (RAM) 603 from a program stored in a read-only memory (ROM) 602 or from a storage portion 608 Instead, perform various appropriate actions and processes.
  • RAM random access memory
  • ROM read-only memory
  • various programs and data required for the operation of the system 600 are also stored.
  • the CPU 601, the ROM 602, and the RAM 603 are connected to each other through a bus 604.
  • An input / output (I / O) interface 605 is also connected to the bus 604.
  • the following components are connected to the I / O interface 605: an input portion 606 including a keyboard, a mouse, and the like; an output portion 607 including a cathode ray tube (CRT), a liquid crystal display (LCD), and a speaker; a storage portion 608 including a hard disk and the like; a communication section 609 including a network interface card such as a LAN card, a modem, and the like.
  • the communication section 609 performs communication processing via a network such as the Internet.
  • the driver 610 is also connected to the I / O interface 605 as necessary.
  • a removable medium 611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is installed on the drive 610 as needed, so that a computer program read therefrom is installed into the storage section 608 as needed.
  • the process described above with reference to the flowchart may be implemented as a computer software program.
  • embodiments of the present disclosure include a computer program product including a computer program carried on a computer-readable medium, the computer program containing program code for performing a method shown in a flowchart.
  • the computer program may be downloaded and installed from a network through the communication section 609, and / or installed from a removable medium 611.
  • CPU central processing unit
  • the computer-readable medium described in this application may be a computer-readable signal medium or a computer-readable storage medium or any combination of the two.
  • the computer-readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples of computer-readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programming read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the foregoing.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in combination with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal that is included in baseband or propagated as part of a carrier wave, and which carries computer-readable program code. Such a propagated data signal may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • the computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, and the computer-readable medium may send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device.
  • Program code embodied on a computer-readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
  • Computer program code for performing the operations of this application may be written in one or more programming languages, or a combination thereof, including programming languages such as Java, Smalltalk, C ++, and also conventional Procedural programming language—such as "C" or a similar programming language.
  • the program code can be executed entirely on the user's computer, partly on the user's computer, as an independent software package, partly on the user's computer, partly on a remote computer, or entirely on a remote computer or server.
  • the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider) Internet connection).
  • LAN local area network
  • WAN wide area network
  • Internet service provider Internet service provider
  • each block in the flowchart or block diagram may represent a module, a program segment, or a part of code, which contains one or more functions to implement a specified logical function Executable instructions.
  • the functions labeled in the blocks may also occur in a different order than those labeled in the drawings. For example, two blocks represented one after the other may actually be executed substantially in parallel, and they may sometimes be executed in the reverse order, depending on the functions involved.
  • each block in the block diagrams and / or flowcharts, and combinations of blocks in the block diagrams and / or flowcharts can be implemented by a dedicated hardware-based system that performs the specified function or operation , Or it can be implemented with a combination of dedicated hardware and computer instructions.
  • the units described in the embodiments of the present application may be implemented by software or hardware.
  • the described unit may also be provided in a processor, for example, it may be described as: a processor includes an image acquisition unit to be identified, an image recognition unit, and an image first deletion unit.
  • a processor includes an image acquisition unit to be identified, an image recognition unit, and an image first deletion unit.
  • the names of these units do not constitute a limitation on the unit itself in some cases.
  • the image acquisition unit to be identified may also be described as a “unit to acquire an image to be identified”.
  • the present application also provides a computer-readable medium, which may be included in the server described in the above embodiments; or may exist alone without being assembled into the server.
  • the computer readable medium carries one or more programs, and when the one or more programs are executed by the server, the server: obtains an image to be identified; enters the image to be identified into a pre-trained screen image recognition model to obtain It is used to characterize the recognition result of whether the image to be recognized is a screenshot image, wherein the screenshot image recognition model is used to characterize the correspondence between the image to be recognized and the recognition result; in response to the recognition result, the image to be recognized is a screenshot image, and the image to be recognized is deleted .

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)

Abstract

La présente invention concerne un procédé et un appareil de reconnaissance d'image. Le procédé comprend les étapes consistant à : obtenir une image à reconnaître (201) ; entrer ladite image dans un modèle de reconnaissance d'image de capture d'écran pré-entraîné pour obtenir un résultat de reconnaissance permettant d'indiquer si ladite image est une image de capture d'écran (202) ; et supprimer ladite image en réponse au résultat de reconnaissance indiquant que ladite image est une image de capture d'écran (203). La reconnaissance d'une image à reconnaître et la suppression d'une image de capture d'écran sont mises en œuvre. En raison de l'utilisation d'un modèle de reconnaissance d'image de capture d'écran, l'efficacité de vérification et de reconnaissance d'une image est améliorée par rapport à une vérification manuelle.
PCT/CN2018/116335 2018-06-27 2018-11-20 Procédé et appareil de reconnaissance d'image WO2020000879A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810680031.4A CN109002842A (zh) 2018-06-27 2018-06-27 图像识别方法和装置
CN201810680031.4 2018-06-27

Publications (1)

Publication Number Publication Date
WO2020000879A1 true WO2020000879A1 (fr) 2020-01-02

Family

ID=64602070

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/116335 WO2020000879A1 (fr) 2018-06-27 2018-11-20 Procédé et appareil de reconnaissance d'image

Country Status (2)

Country Link
CN (1) CN109002842A (fr)
WO (1) WO2020000879A1 (fr)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111291644A (zh) * 2020-01-20 2020-06-16 北京百度网讯科技有限公司 用于处理信息的方法和装置
CN111310693A (zh) * 2020-02-26 2020-06-19 腾讯科技(深圳)有限公司 图像中文本的智能标注方法、装置及存储介质
CN111353434A (zh) * 2020-02-28 2020-06-30 北京市商汤科技开发有限公司 信息识别方法及装置、系统、电子设备和存储介质
CN111353470A (zh) * 2020-03-13 2020-06-30 北京字节跳动网络技术有限公司 图像的处理方法、装置、可读介质和电子设备
CN111597966A (zh) * 2020-05-13 2020-08-28 北京达佳互联信息技术有限公司 一种表情图像识别方法、装置及系统
CN111767918A (zh) * 2020-02-21 2020-10-13 北京沃东天骏信息技术有限公司 一种图片识别方法和装置
CN111797645A (zh) * 2020-07-08 2020-10-20 北京京东振世信息技术有限公司 用于识别条形码的方法和装置
CN111815505A (zh) * 2020-07-14 2020-10-23 北京字节跳动网络技术有限公司 用于处理图像的方法、装置、设备和计算机可读介质
CN111860284A (zh) * 2020-07-15 2020-10-30 上海钧正网络科技有限公司 一种换电柜的安全管理方法、装置、介质及服务器
CN111914822A (zh) * 2020-07-23 2020-11-10 腾讯科技(深圳)有限公司 文本图像标注方法、装置、计算机可读存储介质及设备
CN111950591A (zh) * 2020-07-09 2020-11-17 中国科学院深圳先进技术研究院 模型训练方法、交互关系识别方法、装置及电子设备
CN112287757A (zh) * 2020-09-25 2021-01-29 北京百度网讯科技有限公司 水体识别方法、装置、电子设备及存储介质
CN112541543A (zh) * 2020-12-11 2021-03-23 深圳市优必选科技股份有限公司 图像识别方法、装置、终端设备及存储介质
CN112905843A (zh) * 2021-03-17 2021-06-04 北京文香信息技术有限公司 一种基于视频流的信息处理方法、装置以及存储介质
CN112989986A (zh) * 2021-03-09 2021-06-18 北京京东乾石科技有限公司 用于识别人群行为的方法、装置、设备以及存储介质
CN113221920A (zh) * 2021-05-20 2021-08-06 北京百度网讯科技有限公司 图像识别方法、装置、设备、存储介质以及计算机程序产品
CN113361404A (zh) * 2021-06-02 2021-09-07 北京百度网讯科技有限公司 用于识别文本的方法、装置、设备、存储介质及程序产品
CN113419915A (zh) * 2021-07-21 2021-09-21 北京百度网讯科技有限公司 云终端桌面静止确定方法和装置
CN113538450A (zh) * 2020-04-21 2021-10-22 百度在线网络技术(北京)有限公司 用于生成图像的方法及装置
CN113610968A (zh) * 2021-08-17 2021-11-05 北京京东乾石科技有限公司 目标检测模型的更新方法及装置
CN113643136A (zh) * 2021-09-01 2021-11-12 京东科技信息技术有限公司 信息处理方法、系统和装置

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111368902A (zh) * 2020-02-28 2020-07-03 北京三快在线科技有限公司 一种数据标注的方法及装置
CN113741680A (zh) * 2020-05-27 2021-12-03 北京字节跳动网络技术有限公司 信息交互方法和装置
CN113546398A (zh) * 2021-07-30 2021-10-26 重庆五诶科技有限公司 基于人工智能算法的棋牌游戏方法及系统
CN114693629A (zh) * 2022-03-25 2022-07-01 北京城市网邻信息技术有限公司 图像识别方法、装置、电子设备及可读介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1761204A (zh) * 2005-11-18 2006-04-19 郑州金惠计算机系统工程有限公司 在互联网上堵截色情图像与不良信息的系统
CN103605992A (zh) * 2013-11-28 2014-02-26 国家电网公司 一种电力内外网交互中的敏感图像识别方法
CN106446932A (zh) * 2016-08-30 2017-02-22 上海交通大学 基于机器学习与图片识别的可进化违禁图片批量处理方法

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104598902A (zh) * 2015-01-29 2015-05-06 百度在线网络技术(北京)有限公司 一种用于识别截图的方法、装置和浏览器
CN105654057A (zh) * 2015-12-31 2016-06-08 中国建设银行股份有限公司 基于图片内容的图片审核系统及图片审核方法
CN107133629B (zh) * 2016-02-29 2020-09-04 百度在线网络技术(北京)有限公司 图片分类方法、装置和移动终端
CN106599937A (zh) * 2016-12-29 2017-04-26 池州职业技术学院 一种不良图片过滤装置
CN108124191B (zh) * 2017-12-22 2019-07-12 北京百度网讯科技有限公司 一种视频审核方法、装置及服务器

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1761204A (zh) * 2005-11-18 2006-04-19 郑州金惠计算机系统工程有限公司 在互联网上堵截色情图像与不良信息的系统
CN103605992A (zh) * 2013-11-28 2014-02-26 国家电网公司 一种电力内外网交互中的敏感图像识别方法
CN106446932A (zh) * 2016-08-30 2017-02-22 上海交通大学 基于机器学习与图片识别的可进化违禁图片批量处理方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
HAN ZHONGHAI: "The design and Implementation of Photo-sensitive Recognition System Based on Prohibited Gallery", CHINA MASTER'S THESES FULL-TEXT DATABASE, 15 May 2012 (2012-05-15), pages 33 - 48, ISSN: 1674-0246 *

Cited By (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111291644B (zh) * 2020-01-20 2023-04-18 北京百度网讯科技有限公司 用于处理信息的方法和装置
CN111291644A (zh) * 2020-01-20 2020-06-16 北京百度网讯科技有限公司 用于处理信息的方法和装置
CN111767918A (zh) * 2020-02-21 2020-10-13 北京沃东天骏信息技术有限公司 一种图片识别方法和装置
CN111310693B (zh) * 2020-02-26 2023-08-29 腾讯科技(深圳)有限公司 图像中文本的智能标注方法、装置及存储介质
CN111310693A (zh) * 2020-02-26 2020-06-19 腾讯科技(深圳)有限公司 图像中文本的智能标注方法、装置及存储介质
CN111353434A (zh) * 2020-02-28 2020-06-30 北京市商汤科技开发有限公司 信息识别方法及装置、系统、电子设备和存储介质
CN111353470A (zh) * 2020-03-13 2020-06-30 北京字节跳动网络技术有限公司 图像的处理方法、装置、可读介质和电子设备
CN111353470B (zh) * 2020-03-13 2023-08-01 北京字节跳动网络技术有限公司 图像的处理方法、装置、可读介质和电子设备
CN113538450A (zh) * 2020-04-21 2021-10-22 百度在线网络技术(北京)有限公司 用于生成图像的方法及装置
CN113538450B (zh) * 2020-04-21 2023-07-21 百度在线网络技术(北京)有限公司 用于生成图像的方法及装置
US11810333B2 (en) 2020-04-21 2023-11-07 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for generating image of webpage content
CN111597966A (zh) * 2020-05-13 2020-08-28 北京达佳互联信息技术有限公司 一种表情图像识别方法、装置及系统
CN111597966B (zh) * 2020-05-13 2023-10-10 北京达佳互联信息技术有限公司 一种表情图像识别方法、装置及系统
CN111797645A (zh) * 2020-07-08 2020-10-20 北京京东振世信息技术有限公司 用于识别条形码的方法和装置
CN111950591B (zh) * 2020-07-09 2023-09-01 中国科学院深圳先进技术研究院 模型训练方法、交互关系识别方法、装置及电子设备
CN111950591A (zh) * 2020-07-09 2020-11-17 中国科学院深圳先进技术研究院 模型训练方法、交互关系识别方法、装置及电子设备
CN111815505A (zh) * 2020-07-14 2020-10-23 北京字节跳动网络技术有限公司 用于处理图像的方法、装置、设备和计算机可读介质
CN111860284A (zh) * 2020-07-15 2020-10-30 上海钧正网络科技有限公司 一种换电柜的安全管理方法、装置、介质及服务器
CN111914822B (zh) * 2020-07-23 2023-11-17 腾讯科技(深圳)有限公司 文本图像标注方法、装置、计算机可读存储介质及设备
CN111914822A (zh) * 2020-07-23 2020-11-10 腾讯科技(深圳)有限公司 文本图像标注方法、装置、计算机可读存储介质及设备
CN112287757B (zh) * 2020-09-25 2024-04-26 北京百度网讯科技有限公司 水体识别方法、装置、电子设备及存储介质
CN112287757A (zh) * 2020-09-25 2021-01-29 北京百度网讯科技有限公司 水体识别方法、装置、电子设备及存储介质
CN112541543A (zh) * 2020-12-11 2021-03-23 深圳市优必选科技股份有限公司 图像识别方法、装置、终端设备及存储介质
CN112541543B (zh) * 2020-12-11 2023-11-24 深圳市优必选科技股份有限公司 图像识别方法、装置、终端设备及存储介质
CN112989986A (zh) * 2021-03-09 2021-06-18 北京京东乾石科技有限公司 用于识别人群行为的方法、装置、设备以及存储介质
CN112905843A (zh) * 2021-03-17 2021-06-04 北京文香信息技术有限公司 一种基于视频流的信息处理方法、装置以及存储介质
CN113221920A (zh) * 2021-05-20 2021-08-06 北京百度网讯科技有限公司 图像识别方法、装置、设备、存储介质以及计算机程序产品
CN113221920B (zh) * 2021-05-20 2024-01-12 北京百度网讯科技有限公司 图像识别方法、装置、设备、存储介质以及计算机程序产品
CN113361404A (zh) * 2021-06-02 2021-09-07 北京百度网讯科技有限公司 用于识别文本的方法、装置、设备、存储介质及程序产品
CN113419915A (zh) * 2021-07-21 2021-09-21 北京百度网讯科技有限公司 云终端桌面静止确定方法和装置
CN113610968A (zh) * 2021-08-17 2021-11-05 北京京东乾石科技有限公司 目标检测模型的更新方法及装置
CN113643136A (zh) * 2021-09-01 2021-11-12 京东科技信息技术有限公司 信息处理方法、系统和装置

Also Published As

Publication number Publication date
CN109002842A (zh) 2018-12-14

Similar Documents

Publication Publication Date Title
WO2020000879A1 (fr) Procédé et appareil de reconnaissance d'image
WO2019242222A1 (fr) Procédé et dispositif à utiliser lors de la génération d'informations
WO2020006963A1 (fr) Procédé et appareil de production de modèle de détection d'image
CN108427939B (zh) 模型生成方法和装置
WO2020000876A1 (fr) Procédé et dispositif de génération de modèle
WO2020006961A1 (fr) Procédé et dispositif d'extraction d'image
US11436863B2 (en) Method and apparatus for outputting data
WO2019237657A1 (fr) Procédé et dispositif de génération de modèle
CN109740018B (zh) 用于生成视频标签模型的方法和装置
CN109993150B (zh) 用于识别年龄的方法和装置
CN109034069B (zh) 用于生成信息的方法和装置
CN108235116B (zh) 特征传播方法和装置、电子设备和介质
JP7394809B2 (ja) ビデオを処理するための方法、装置、電子機器、媒体及びコンピュータプログラム
US11087140B2 (en) Information generating method and apparatus applied to terminal device
WO2020029608A1 (fr) Procédé et appareil pour détecter une bavure de feuille d'électrode
CN108235004B (zh) 视频播放性能测试方法、装置和系统
CN109583389B (zh) 绘本识别方法及装置
CN110070076B (zh) 用于选取训练用样本的方法和装置
CN110046571B (zh) 用于识别年龄的方法和装置
CN109816023B (zh) 用于生成图片标签模型的方法和装置
CN108399401B (zh) 用于检测人脸图像的方法和装置
CN113033677A (zh) 视频分类方法、装置、电子设备和存储介质
CN110008926B (zh) 用于识别年龄的方法和装置
CN111258414B (zh) 用于调整屏幕的方法和装置
CN108921138B (zh) 用于生成信息的方法和装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18923991

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 06/05/2021)

122 Ep: pct application non-entry in european phase

Ref document number: 18923991

Country of ref document: EP

Kind code of ref document: A1