CN107832765A - Picture recognition to including word content and picture material - Google Patents

Picture recognition to including word content and picture material Download PDF

Info

Publication number
CN107832765A
CN107832765A CN201710823997.4A CN201710823997A CN107832765A CN 107832765 A CN107832765 A CN 107832765A CN 201710823997 A CN201710823997 A CN 201710823997A CN 107832765 A CN107832765 A CN 107832765A
Authority
CN
China
Prior art keywords
picture
computer
image
character area
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710823997.4A
Other languages
Chinese (zh)
Inventor
邓玥琳
高光明
刘辉
丁飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201710823997.4A priority Critical patent/CN107832765A/en
Publication of CN107832765A publication Critical patent/CN107832765A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

It is an object of the invention to provide a kind of image identification method, device and computer equipment, computer-readable recording medium and computer program product.Character area and image-region in picture recognition device positioning picture;Corresponding term vector is extracted to the word content in the character area;To described image extracted region images characteristic vector;The term vector and described image characteristic vector are integrated, to determine the semanteme of the picture.Compared with prior art, the invention provides a kind of scheme that picture is identified automatically by computer equipment, so as to carry out content recognition and examination & verification before issue to the picture including word content and picture material.

Description

Picture recognition to including word content and picture material
Technical field
The present invention relates to picture recognition technical field, and in particular to a kind of picture to including word content and picture material Carry out the technology of semantics recognition.
Background technology
Current picture recognition technology is higher for the accuracy rate of the relatively simple picture of content, so as to which examination & verification is completed automatically Preferably.Or for example, it is word, trade mark for content, or being the picture of single theme, picture examination system can directly lead to OCR technique is crossed to identify the word or trade mark in picture, or by based on the Image Classfication Technology of deep learning in picture Element be identified, to judge whether it meets promulgated standard.
However, for the picture including word content and picture material simultaneously, existing picture recognition technology is more difficult to be applicable, Carried out more by manual examination and verification, this make it that the pressure of auditor is larger, and Review Cycle is also longer.
The content of the invention
It is an object of the invention to provide a kind of image identification method, device and computer equipment, computer-readable storage Medium and computer program product.
According to an aspect of the invention, there is provided a kind of image identification method, wherein, this method comprises the following steps:
Character area and image-region in-positioning picture;
- corresponding term vector is extracted to the word content in the character area;
- to described image extracted region images characteristic vector;
- integrated the term vector and described image characteristic vector, to determine the semanteme of the picture.
According to an aspect of the present invention, a kind of picture recognition device is additionally provided, wherein, the device includes:
For positioning the device of character area and image-region in picture;
For extracting the device of corresponding term vector to the word content in the character area;
For the device to described image extracted region images characteristic vector;
For the term vector and described image characteristic vector to be integrated, to determine the semantic dress of the picture Put.
According to an aspect of the present invention, additionally provide a kind of computer equipment, including memory, processor and be stored in On memory and the computer program that can run on a processor, wherein, it is real during computer program described in the computing device A kind of now image identification method according to an aspect of the present invention.
According to an aspect of the present invention, a kind of computer-readable recording medium is additionally provided, is stored thereon with computer Program, wherein, a kind of picture recognition according to an aspect of the present invention is realized when the computer program is executed by processor Method.
According to an aspect of the present invention, a kind of computer program product is additionally provided, when the computer program product A kind of image identification method according to an aspect of the present invention is realized when being performed by computer equipment.
Compared with prior art, the invention provides a kind of scheme that picture is identified automatically by computer equipment, So as to carry out content recognition and examination & verification before issue to the picture including word content and picture material.Specifically, the present invention is logical Cross and more fine-grained segmentation is carried out to the picture including word content and picture material, be accurately positioned character area and figure therein As region, and then the term vector and image feature vector that are extracted to each region are integrated, to identify the semanteme of picture.Knowing Do not go out after the semanteme of picture, the present invention can also determine whether it meets promulgated standard, to carry out issue examination & verification, example Such as judge whether picture to be released has vulgar content, when with vulgar content, then do not meet promulgated standard, belong to high wind Dangerous picture.
The present invention can be efficiently applied to the issue examination & verification of advertisement.Advertisement figure for including word content and picture material Piece, picture recognition system of the invention can be identified and issue risk identification to it, to accelerate the issue of advertising pictures speed Degree, while ensure that excessive risk picture is identified and filter without by improper issue, this also improves ad distribution user and net The experience of network user.
Brief description of the drawings
By reading the detailed description made to non-limiting example made with reference to the following drawings, of the invention is other Feature, objects and advantages will become more apparent upon:
Fig. 1 shows to be suitable to the block diagram for being used for realizing the exemplary computer system/server 12 of embodiment of the present invention;
Fig. 2 shows according to an embodiment of the invention a kind of to know the picture including word content and picture material Method for distinguishing flow chart;
Fig. 3 shows the schematic diagram for including word content and the picture of picture material according to an example of the present invention;
Fig. 4 shows according to an embodiment of the invention a kind of to know the picture including word content and picture material The schematic diagram of other device.
Same or analogous reference represents same or analogous part in accompanying drawing.
Embodiment
It should be mentioned that some exemplary embodiments are described as before exemplary embodiment is discussed in greater detail The processing described as flow chart or method.Although operations are described as the processing of order by flow chart, therein to be permitted Multioperation can be implemented concurrently, concomitantly or simultaneously.In addition, the order of operations can be rearranged.When it The processing can be terminated when operation is completed, it is also possible to the additional step being not included in accompanying drawing.The processing It can correspond to method, function, code, subroutine, subprogram etc..
Alleged within a context " computer equipment ", also referred to as " computer ", referring to can be by running preset program or referring to Order performs the intelligent electronic device of the predetermined process process such as numerical computations and/or logical calculated, its can include processor with Memory, the programmed instruction to be prestored in memory by computing device perform predetermined process process, or by ASIC, The hardware such as FPGA, DSP perform predetermined process process, or are realized by said two devices combination.Computer equipment includes but unlimited In server, personal computer (PC), notebook computer, tablet personal computer, smart mobile phone etc..
The computer equipment is for example including user equipment and the network equipment.Wherein, the user equipment includes but unlimited In personal computer (PC), notebook computer, mobile terminal etc., the mobile terminal includes but is not limited to smart mobile phone, PDA Deng;The network equipment includes but is not limited to single network server, the server group of multiple webservers composition or is based on The cloud being made up of a large amount of computers or the webserver of cloud computing (Cloud Computing), wherein, cloud computing is distributed One kind of calculating, a super virtual computer being made up of the computer collection of a group loose couplings.Wherein, the computer is set It is standby can isolated operation realize the present invention, also can access network and pass through the interactive operation with other computer equipments in network To realize the present invention.Wherein, the network residing for the computer equipment includes but is not limited to internet, wide area network, Metropolitan Area Network (MAN), office Domain net, VPN etc..
It should be noted that the user equipment, the network equipment and network etc. are only for example, other are existing or from now on may be used The computer equipment or network that can occur such as are applicable to the present invention, should also be included within the scope of the present invention, and to draw It is incorporated herein with mode.
The method (some of them illustrated by flow) discussed herein below can by hardware, software, firmware, in Between part, microcode, hardware description language or its any combination implement.When with software, firmware, middleware or microcode come real Shi Shi, to implement the program code of necessary task or code segment can be stored in machine or computer-readable medium (such as Storage medium) in.(one or more) processor can implement necessary task.
Concrete structure and function detail disclosed herein are only representational, and are for describing showing for the present invention The purpose of example property embodiment.But the present invention can be implemented by many alternative forms, and it is not interpreted as It is limited only by the embodiments set forth herein.
Although it should be appreciated that may have been used term " first ", " second " etc. herein to describe unit, But these units should not be limited by these terms.It is used for the purpose of using these terms by a unit and another unit Make a distinction.For example, in the case of the scope without departing substantially from exemplary embodiment, it is single that first module can be referred to as second Member, and similarly second unit can be referred to as first module.Term "and/or" used herein above include one of them or Any and all combination of more listed associated items.
Term used herein above is not intended to limit exemplary embodiment just for the sake of description specific embodiment.Unless Context clearly refers else, otherwise singulative used herein above "one", " one " also attempt to include plural number.Should also When understanding, term " comprising " and/or "comprising" used herein above provide stated feature, integer, step, operation, The presence of unit and/or component, and do not preclude the presence or addition of other one or more features, integer, step, operation, unit, Component and/or its combination.
It should further be mentioned that in some replaces realization modes, the function/action being previously mentioned can be according to different from attached The order indicated in figure occurs.For example, depending on involved function/action, the two width figures shown in succession actually may be used Substantially simultaneously to perform or can perform in a reverse order sometimes.
The present invention is described in further detail below in conjunction with the accompanying drawings.
Fig. 1 shows the block diagram suitable for being used for the exemplary computer system/server 12 for realizing embodiment of the present invention. The computer system/server 12 that Fig. 1 is shown is only an example, should not be to the function and use range of the embodiment of the present invention Bring any restrictions.
As shown in figure 1, computer system/server 12 is showed in the form of universal computing device.Computer system/service The component of device 12 can include but is not limited to:One or more processor or processing unit 16, system storage 28, connection The bus 18 of different system component (including system storage 28 and processing unit 16).
Bus 18 represents the one or more in a few class bus structures, including memory bus or Memory Controller, Peripheral bus, graphics acceleration port, processor or the local bus using any bus structures in a variety of bus structures.Lift For example, these architectures include but is not limited to industry standard architecture (ISA) bus, MCA (MAC) Bus, enhanced isa bus, VESA's (VESA) local bus and periphery component interconnection (PCI) bus.
Computer system/server 12 typically comprises various computing systems computer-readable recording medium.These media can be appointed What usable medium that can be accessed by computer system/server 12, including volatibility and non-volatile media, it is moveable and Immovable medium.
Memory 28 can include the computer system readable media of form of volatile memory, such as random access memory Device (RAM) 30 and/or cache memory 32.Computer system/server 12 may further include it is other it is removable/no Movably, volatile/non-volatile computer system storage medium.Only as an example, storage system 34 can be used for reading and writing Immovable, non-volatile magnetic media (Fig. 1 is not shown, commonly referred to as " hard disk drive ").Although not shown in Fig. 1, can To provide the disc driver being used for may move non-volatile magnetic disk (such as " floppy disk ") read-write, and to removable non-volatile Property CD (such as CD-ROM, DVD-ROM or other optical mediums) read-write CD drive.In these cases, it is each to drive Dynamic device can be connected by one or more data media interfaces with bus 18.Memory 28 can include at least one program Product, the program product have one group of (for example, at least one) program module, and these program modules are configured to perform the present invention The function of each embodiment.
Program/utility 40 with one group of (at least one) program module 42, such as memory 28 can be stored in In, such program module 42 includes --- but being not limited to --- operating system, one or more application program, other programs Module and routine data, the realization of network environment may be included in each or certain combination in these examples.Program mould Block 42 generally performs function and/or method in embodiment described in the invention.
Computer system/server 12 can also be (such as keyboard, sensing equipment, aobvious with one or more external equipments 14 Show device 24 etc.) communication, it can also enable a user to lead to the equipment that the computer system/server 12 interacts with one or more Letter, and/or any set with make it that the computer system/server 12 communicated with one or more of the other computing device Standby (such as network interface card, modem etc.) communicates.This communication can be carried out by input/output (I/O) interface 22.And And computer system/server 12 can also pass through network adapter 20 and one or more network (such as LAN (LAN), wide area network (WAN) and/or public network, such as internet) communication.As illustrated, network adapter 20 passes through bus 18 communicate with other modules of computer system/server 12.It should be understood that although not shown in Fig. 1, computer can be combined Systems/servers 12 use other hardware and/or software module, include but is not limited to:Microcode, device driver, at redundancy Manage unit, external disk drive array, RAID system, tape drive and data backup storage system etc..
Processing unit 16 is stored in the program in memory 28 by operation, so as to perform various function application and data Processing.
For example, the various functions for performing the present invention and the computer program of processing, processing are stored with memory 28 When unit 16 performs corresponding computer program, the present invention is realized to the identification including word content and the picture of picture material.
The present invention described in detail below realizes the specific work(to the identification including word content and the picture of picture material Energy/step.
Fig. 2 is shown according to one embodiment of present invention, wherein specifically illustrating one kind to including in word content and image The method flow diagram that the picture of appearance is identified.
The recognition methods is performed by picture recognition system.Picture recognition system typically lies in network side, such as arranges In one or more server.
As shown in Fig. 2 in step sl, character area and image-region in picture recognition system positioning picture;In step In rapid S2, picture recognition system extracts corresponding term vector to the word content in the character area;In step s3, picture Identifying system is to described image extracted region images characteristic vector;In step s 4, picture recognition system by the term vector with Described image characteristic vector is integrated, to determine the semanteme of the picture.
Specifically, in step sl, the character area and image-region in picture recognition system positioning picture.
As shown in figure 3, a picture to be released includes word content and picture material.The picture recognition system of the present invention It is intended to realize automatic identification to such picture including word content and picture material.
Picture recognition system can distinguish character area and figure therein to the picture including word content and picture material As region, and character area and image-region frame are elected respectively by candidate frame as shown in Figure 3.Wherein, character area Shown by solid box, image-region is shown by dotted line frame.
Here, the identification to the character area in picture and image-region can be by the good target detection model of training in advance Carry out.The typically for example various target detection models based on deep learning of target detection model, such as Faster-rcnn moulds Type, yolo models.
For example, collecting the picture through being labeled with character area and image-region in advance, and the picture input through mark is treated The target detection model of training, to carry out model training, so as to obtain the target detection model trained.
Specifically, will be simultaneously comprising word and other mesh by taking the classical model Faster-rcnn models of target detection as an example The picture of mark (as built) is trained after carrying out candidate frame mark and classification annotation, passes through RPN networks (Region Proposal Networks) extraction candidate frame, convolutional layer last layer by roipooling layers by feature in candidate frame A unified size is normalized to, then passes through the loss training networks classified and returned, realization pair respectively with full articulamentum In the position of candidate frame and classification (being that the candidate frame including word is still included such as the candidate frame of other targets of building) Prediction.
In step s 2, picture recognition system extracts corresponding term vector to the word content in character area.
Here, picture recognition system carries out character recognition to the character area in picture, to obtain identified character, enter And the character to being identified segments, to extract term vector therein.
Wherein, picture recognition system can use various existing character recognition technologies to carry out the character in character area Identification.Typically character recognition technologies such as OCR (Optical Character Recognition, optical character identification) skill Art.
After character is extracted, picture recognition system also needs further to extract term vector therein, such as passes through natural language The conventional word2vec methods of process field are sayed, extract corresponding participle, the character/word identified is as the defeated of term vector Enter.
In step s3, picture recognition system extracts image feature vector to image-region.
Carried here, picture recognition system can perform image feature vector by various image classification models to image-region Take.
For example, any CNN (Convolutional such as AlexNet, VGG or ResNet can be used in picture recognition system Neural Networks, convolutional neural networks) model realization image feature vector extraction, and can take last or in Between some full articulamentum data as image feature vector.
In step s 4, picture recognition system is integrated the term vector extracted with image feature vector, to determine The semanteme of picture.
Here, picture recognition system by semantics recognition model to the term vector extracted in step S2 with being extracted in step S3 Image feature vector integrated, to identify the semanteme of picture.
For example, DNN (Deep Neural Network, deep neural network), RNN can be used in picture recognition system (Recurrent Neural Network, recurrent neural network) or LSTM (Long Short-Term Memory, shot and long term note Recall unit) model etc., by the way that term vector and image feature vector directly such as are spliced into a characteristic vector, and then to this feature Vector carries out realizing the semantics recognition to the characteristic vector after the integration by Softmax graders.
Here, it should be noted that those skilled in the art will be understood that no matter enter herein using which kind of foregoing model Row vector is integrated and semantics recognition, extracts the selection of both vector field homoemorphism types before not influenceing respectively.
Preferably, picture recognition system can also further add audit function, so as to which picture recognition system upgrade is figure Piece auditing system.
Specifically, picture examination system judges whether the picture meets promulgated standard according to the semanteme of picture.
Here, the picture semantic that picture examination system identifies according to semantics recognition model, judges whether the picture accords with Promulgated standard is closed, such as is related to the pictures of the illegal contents such as vulgar, violence, reaction and does not meet promulgated standard, belongs to excessive risk Picture, then it can not pass through examination & verification.
Fig. 4 is shown according to one embodiment of present invention, wherein specifically illustrating one kind to including in word content and image The schematic diagram for the device that the picture of appearance is identified.
The identification device can typically be considered as a picture recognition system, and the picture recognition system typically lies in net Network side, such as it is arranged in one or more server.
Carried as shown in figure 4, picture recognition system includes regional positioning device 41, term vector extraction element 42, characteristics of image Take device 43 and semantic recognition device 44.
Wherein, regional positioning device 41 positions the character area and image-region in picture;Term vector extraction element 42 is right Word content in the character area extracts corresponding term vector;Image characteristics extraction device 43 is to described image extracted region Image feature vector;Semantic recognition device 44 is integrated the term vector and described image characteristic vector, with described in determination The semanteme of picture.
Specifically, regional positioning device 41 positions the character area and image-region in picture.
As shown in figure 3, a picture to be released includes word content and picture material.The picture recognition system of the present invention It is intended to realize automatic identification to such picture including word content and picture material.
Regional positioning device 41 picture including word content and picture material can be distinguished character area therein and Image-region, and respectively elected character area and image-region frame by candidate frame as shown in Figure 3.Wherein, literal field Domain is shown that image-region is shown by dotted line frame by solid box.
Here, regional positioning device 41 can call various target detection models to identify the character area and figure in picture As region.Identification to the character area in picture and image-region can be carried out by the good target detection model of training in advance. The typically for example various target detection models based on deep learning of target detection model, such as Faster-rcnn models, yolo Model.
For example, collecting the picture through being labeled with character area and image-region in advance, and the picture input through mark is treated The target detection model of training, to carry out model training, so as to obtain the target detection model trained.
Specifically, will be simultaneously comprising word and other mesh by taking the classical model Faster-rcnn models of target detection as an example The picture of mark (as built) is trained after carrying out candidate frame mark and classification annotation, passes through RPN networks (Region Proposal Networks) extraction candidate frame, convolutional layer last layer by roipooling layers by feature in candidate frame A unified size is normalized to, then passes through the loss training networks classified and returned, realization pair respectively with full articulamentum In the position of candidate frame and classification (being that the candidate frame including word is still included such as the candidate frame of other targets of building) Prediction.
Term vector extraction element 42 extracts corresponding term vector to the word content in character area.
Here, term vector extraction element 42 carries out character recognition to the character area in picture, to obtain identified word Symbol, and then the character to being identified segments, to extract term vector therein.
Wherein, term vector extraction element 42 can use various existing character recognition technologies to the character in character area It is identified.Typically character recognition technologies such as OCR (know by Optical Character Recognition, optical character Not) technology.
After character is extracted, term vector extraction element 42 also needs further to extract term vector therein, such as by certainly The conventional word2vec methods in right Language Processing field, extract corresponding participle, the character/word identified is as term vector Input.
Image characteristics extraction device 43 extracts image feature vector to image-region.
Here, image characteristics extraction device 43 can perform characteristics of image by various image classification models to image-region Vector extraction.
For example, any CNN such as AlexNet, VGG or ResNet can be used in image characteristics extraction device 43 The extraction of (Convolutional Neural Networks, convolutional neural networks) model realization image feature vector, and can be with The data of last or some middle full articulamentum are taken as image feature vector.
Semantic recognition device 44 is integrated the term vector extracted with image feature vector, to determine the language of picture Justice.
Here, term vector and figure that semantic recognition device 44 is extracted by semantics recognition model to term vector extraction element 42 As feature deriving means 43 extract image feature vector integrated, to identify the semanteme of picture.
For example, DNN (Deep Neural Network, deep neural network), RNN can be used in semantic recognition device 44 (Recurrent Neural Network, recurrent neural network) or LSTM (Long Short-Term Memory, shot and long term note Recall unit) model etc., by the way that term vector and image feature vector directly such as are spliced into a characteristic vector, and then to this feature Vector carries out realizing the semantics recognition to the characteristic vector after the integration by Softmax graders.
Here, it should be noted that those skilled in the art will be understood that no matter enter herein using which kind of foregoing model Row vector is integrated and semantics recognition, extracts the selection of both vector field homoemorphism types before not influenceing respectively.
Preferably, picture recognition system can also further add audit function and (be performed by examination & verification device, Fig. 4 does not show Go out), so as to which picture recognition system upgrade is picture examination system.
Specifically, picture examination system (examination & verification device) judges whether the picture meets issue mark according to the semanteme of picture It is accurate.
Here, the picture semantic that examination & verification device identifies according to semantics recognition model, judges whether the picture meets hair Cloth standard, such as be related to the pictures of the illegal contents such as vulgar, violence, reaction and do not meet promulgated standard, belong to excessive risk picture, Examination & verification can not then be passed through.
The present invention can use any combination of one or more computer-readable media.Computer-readable medium can be with It is computer-readable signal media or computer-readable recording medium.Computer-readable recording medium for example can be --- but Be not limited to --- electricity, magnetic, optical, electromagnetic, system, device or the device of infrared ray or semiconductor, or it is any more than combination. The more specifically example (non exhaustive list) of computer-readable recording medium includes:With being electrically connected for one or more wires Connect, portable computer diskette, hard disk, random access memory (RAM), read-only storage (ROM), erasable type may be programmed it is read-only Memory (EPROM or flash memory), optical fiber, portable compact disc read-only storage (CD-ROM), light storage device, magnetic memory Part or above-mentioned any appropriate combination.In this document, computer-readable recording medium can any be included or store The tangible medium of program, the program can be commanded the either device use or in connection of execution system, device.
Computer-readable signal media can include in a base band or as carrier wave a part propagation data-signal, Wherein carry computer-readable program code.The data-signal of this propagation can take various forms, including --- but It is not limited to --- electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be Any computer-readable medium beyond computer-readable recording medium, the computer-readable medium can send, propagate or Transmit for by instruction execution system, device either device use or program in connection.
The program code included on computer-readable medium can be transmitted with any appropriate medium, including --- but it is unlimited In --- wireless, electric wire, optical cable, RF etc., or above-mentioned any appropriate combination.
It can be write with one or more programming languages or its combination for performing the computer that operates of the present invention Program code, described program design language include object oriented program language-such as Java, Smalltalk, C++, Also include conventional procedural programming language-such as " C " language or similar programming language.Program code can be with Fully perform, partly perform on the user computer on the user computer, the software kit independent as one performs, portion Divide and partly perform or performed completely on remote computer or server on the remote computer on the user computer. Be related in the situation of remote computer, remote computer can pass through the network of any kind --- including LAN (LAN) or Wide area network (WAN)-be connected to subscriber computer, or, it may be connected to outer computer (such as carried using Internet service Pass through Internet connection for business).
It should be noted that the present invention can be carried out in the assembly of software and/or software and hardware, for example, this hair Bright each device can using application specific integrated circuit (ASIC) or any other realized similar to hardware device.It is in addition, of the invention Some steps or function can employ hardware to realize, for example, coordinating as with processor so as to performing each step or function Circuit.
It is obvious to a person skilled in the art that the invention is not restricted to the details of above-mentioned one exemplary embodiment, Er Qie In the case of without departing substantially from spirit or essential attributes of the invention, the present invention can be realized in other specific forms.Therefore, no matter From the point of view of which point, embodiment all should be regarded as exemplary, and be nonrestrictive, the scope of the present invention is by appended power Profit requires rather than described above limits, it is intended that all in the implication and scope of the equivalency of claim by falling Change is included in the present invention.The multiple units or device stated in system claims can also be led to by a unit or device Software or hardware are crossed to realize.

Claims (11)

1. a kind of picture examination method, wherein, this method comprises the following steps:
Character area and image-region in-positioning picture;
- corresponding term vector is extracted to the word content in the character area;
- to described image extracted region images characteristic vector;
- integrated the term vector and described image characteristic vector, to determine the semanteme of the picture.
2. according to the method for claim 1, wherein, the positioning step specifically includes:
- pass through the character area and image-region in target detection Model Identification picture.
3. according to the method for claim 2, wherein, the target detection model is trained by following steps and obtained:
- collect the picture through being labeled with character area and image-region;
- picture through mark is inputted to target detection model to be trained, to carry out model training, so as to be trained The target detection model.
4. according to the method in any one of claims 1 to 3, wherein, the extraction step of the term vector specifically includes:
- character recognition is carried out to the character area, to obtain identified character;
- term vector is extracted from the character identified.
5. method according to any one of claim 1 to 4, wherein, the extraction step of described image characteristic vector is specific Including:
- by image classification model to described image extracted region described image characteristic vector.
6. method according to any one of claim 1 to 5, wherein, the integration step specifically includes:
- term vector and described image characteristic vector are integrated by semantics recognition model, to identify the picture It is semantic.
7. method according to any one of claim 1 to 6, wherein, this method is further comprising the steps of:
- according to the semanteme, judge whether the picture meets promulgated standard.
8. a kind of picture examination device, wherein, the device includes:
For positioning the device of character area and image-region in picture;
For extracting the device of corresponding term vector to the word content in the character area;
For the device to described image extracted region images characteristic vector;
For the term vector and described image characteristic vector to be integrated, to determine the semantic device of the picture.
9. a kind of computer equipment, including memory, processor and storage are on a memory and the meter that can run on a processor Calculation machine program, wherein, realized described in the computing device during computer program as any one of claim 1 to 7 Method.
10. a kind of computer-readable recording medium, is stored thereon with computer program, wherein, the computer program is processed The method as any one of claim 1 to 7 is realized when device performs.
11. a kind of computer program product, realize that right such as will when the computer program product is performed by computer equipment Seek the method any one of 1 to 7.
CN201710823997.4A 2017-09-13 2017-09-13 Picture recognition to including word content and picture material Pending CN107832765A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710823997.4A CN107832765A (en) 2017-09-13 2017-09-13 Picture recognition to including word content and picture material

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710823997.4A CN107832765A (en) 2017-09-13 2017-09-13 Picture recognition to including word content and picture material

Publications (1)

Publication Number Publication Date
CN107832765A true CN107832765A (en) 2018-03-23

Family

ID=61643840

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710823997.4A Pending CN107832765A (en) 2017-09-13 2017-09-13 Picture recognition to including word content and picture material

Country Status (1)

Country Link
CN (1) CN107832765A (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108595544A (en) * 2018-04-09 2018-09-28 深源恒际科技有限公司 A kind of document picture classification method
CN108764374A (en) * 2018-06-11 2018-11-06 网易(杭州)网络有限公司 Image classification method, system, medium and electronic equipment
CN109241904A (en) * 2018-08-31 2019-01-18 平安科技(深圳)有限公司 Text region model training, character recognition method, device, equipment and medium
CN109712108A (en) * 2018-11-05 2019-05-03 杭州电子科技大学 It is a kind of that vision positioning method is directed to based on various distinctive candidate frame generation network
CN109754306A (en) * 2018-11-13 2019-05-14 北京码牛科技有限公司 Information processing method and device, electronic equipment and computer-readable medium
CN109858475A (en) * 2019-01-08 2019-06-07 平安科技(深圳)有限公司 Picture character localization method, device, medium and computer equipment
CN110222846A (en) * 2019-05-13 2019-09-10 中国科学院计算技术研究所 A kind of the information safety protection method and information security system of Internet terminal
CN110502984A (en) * 2019-07-11 2019-11-26 万翼科技有限公司 Drawing review method, apparatus, computer equipment and storage medium
CN110609877A (en) * 2018-06-14 2019-12-24 百度在线网络技术(北京)有限公司 Picture acquisition method, device, equipment and computer storage medium
CN110852894A (en) * 2019-11-04 2020-02-28 泰康保险集团股份有限公司 Insurance underwriting method and device, computer storage medium and electronic equipment
CN111008624A (en) * 2019-12-05 2020-04-14 嘉兴太美医疗科技有限公司 Optical character recognition method and method for generating training sample for optical character recognition
CN111079766A (en) * 2019-12-19 2020-04-28 青岛科技大学 Intelligent method for P & ID (peer-to-peer) graph
CN111242024A (en) * 2020-01-11 2020-06-05 北京中科辅龙科技股份有限公司 Method and system for recognizing legends and characters in drawings based on machine learning
CN111639639A (en) * 2019-03-01 2020-09-08 杭州海康威视数字技术股份有限公司 Method, device, equipment and storage medium for detecting text area
CN111832082A (en) * 2020-08-20 2020-10-27 支付宝(杭州)信息技术有限公司 Image-text integrity detection method and device
CN111881901A (en) * 2020-07-28 2020-11-03 平安科技(深圳)有限公司 Screenshot content detection method and device and computer-readable storage medium
CN111930964A (en) * 2020-09-17 2020-11-13 腾讯科技(深圳)有限公司 Content processing method, device, equipment and storage medium
CN112131376A (en) * 2019-06-24 2020-12-25 北京华雨天成文化传播有限公司 Auditing method and device
CN112200598A (en) * 2020-09-08 2021-01-08 北京数美时代科技有限公司 Picture advertisement identification method and device and computer equipment
CN112861656A (en) * 2021-01-21 2021-05-28 平安科技(深圳)有限公司 Trademark similarity detection method and device, electronic equipment and storage medium
CN113298697A (en) * 2021-03-19 2021-08-24 广州天越电子科技有限公司 Method for converting two-dimensional graphic elements into vector graphic elements based on artificial neural network
CN115019328A (en) * 2022-07-01 2022-09-06 支付宝(杭州)信息技术有限公司 Content risk identification method and system

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663435A (en) * 2012-04-28 2012-09-12 南京邮电大学 Junk image filtering method based on semi-supervision
CN103207870A (en) * 2012-01-17 2013-07-17 华为技术有限公司 Method, server, device and system for photo sort management
CN105426356A (en) * 2015-10-29 2016-03-23 杭州九言科技股份有限公司 Target information identification method and apparatus
US20160203209A1 (en) * 2015-01-12 2016-07-14 Xerox Corporation Joint approach to feature and document labeling
CN106447366A (en) * 2015-08-07 2017-02-22 百度在线网络技术(北京)有限公司 Checking method of multimedia advertisement, and training method and apparatus of advertisement checking model
CN106777185A (en) * 2016-12-23 2017-05-31 浙江大学 A kind of across media Chinese herbal medicine image search methods based on deep learning
CN107122416A (en) * 2017-03-31 2017-09-01 北京大学 A kind of Chinese event abstracting method
CN107133221A (en) * 2017-06-09 2017-09-05 北京京东尚科信息技术有限公司 Signal auditing method, device, computer-readable medium and electronic equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103207870A (en) * 2012-01-17 2013-07-17 华为技术有限公司 Method, server, device and system for photo sort management
CN102663435A (en) * 2012-04-28 2012-09-12 南京邮电大学 Junk image filtering method based on semi-supervision
US20160203209A1 (en) * 2015-01-12 2016-07-14 Xerox Corporation Joint approach to feature and document labeling
CN106447366A (en) * 2015-08-07 2017-02-22 百度在线网络技术(北京)有限公司 Checking method of multimedia advertisement, and training method and apparatus of advertisement checking model
CN105426356A (en) * 2015-10-29 2016-03-23 杭州九言科技股份有限公司 Target information identification method and apparatus
CN106777185A (en) * 2016-12-23 2017-05-31 浙江大学 A kind of across media Chinese herbal medicine image search methods based on deep learning
CN107122416A (en) * 2017-03-31 2017-09-01 北京大学 A kind of Chinese event abstracting method
CN107133221A (en) * 2017-06-09 2017-09-05 北京京东尚科信息技术有限公司 Signal auditing method, device, computer-readable medium and electronic equipment

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
乐毅 等: "《深度学习 Caffe之经典模型详解与实战》", 31 December 2016 *
公安部第三研究所: "《多摄像机协同关注目标检测跟踪技术》", 30 June 2017 *

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108595544A (en) * 2018-04-09 2018-09-28 深源恒际科技有限公司 A kind of document picture classification method
CN108764374A (en) * 2018-06-11 2018-11-06 网易(杭州)网络有限公司 Image classification method, system, medium and electronic equipment
CN110609877A (en) * 2018-06-14 2019-12-24 百度在线网络技术(北京)有限公司 Picture acquisition method, device, equipment and computer storage medium
CN109241904A (en) * 2018-08-31 2019-01-18 平安科技(深圳)有限公司 Text region model training, character recognition method, device, equipment and medium
CN109241904B (en) * 2018-08-31 2023-10-20 平安科技(深圳)有限公司 Character recognition model training, character recognition method, device, equipment and medium
CN109712108A (en) * 2018-11-05 2019-05-03 杭州电子科技大学 It is a kind of that vision positioning method is directed to based on various distinctive candidate frame generation network
CN109754306A (en) * 2018-11-13 2019-05-14 北京码牛科技有限公司 Information processing method and device, electronic equipment and computer-readable medium
CN109858475A (en) * 2019-01-08 2019-06-07 平安科技(深圳)有限公司 Picture character localization method, device, medium and computer equipment
CN111639639B (en) * 2019-03-01 2023-05-02 杭州海康威视数字技术股份有限公司 Method, device, equipment and storage medium for detecting text area
CN111639639A (en) * 2019-03-01 2020-09-08 杭州海康威视数字技术股份有限公司 Method, device, equipment and storage medium for detecting text area
CN110222846A (en) * 2019-05-13 2019-09-10 中国科学院计算技术研究所 A kind of the information safety protection method and information security system of Internet terminal
CN110222846B (en) * 2019-05-13 2021-07-20 中国科学院计算技术研究所 Information security method and information security system for internet terminal
CN112131376A (en) * 2019-06-24 2020-12-25 北京华雨天成文化传播有限公司 Auditing method and device
CN110502984A (en) * 2019-07-11 2019-11-26 万翼科技有限公司 Drawing review method, apparatus, computer equipment and storage medium
CN110502984B (en) * 2019-07-11 2022-08-09 万翼科技有限公司 Drawing review method and device, computer equipment and storage medium
CN110852894A (en) * 2019-11-04 2020-02-28 泰康保险集团股份有限公司 Insurance underwriting method and device, computer storage medium and electronic equipment
CN111008624A (en) * 2019-12-05 2020-04-14 嘉兴太美医疗科技有限公司 Optical character recognition method and method for generating training sample for optical character recognition
CN111079766A (en) * 2019-12-19 2020-04-28 青岛科技大学 Intelligent method for P & ID (peer-to-peer) graph
CN111242024A (en) * 2020-01-11 2020-06-05 北京中科辅龙科技股份有限公司 Method and system for recognizing legends and characters in drawings based on machine learning
CN111881901A (en) * 2020-07-28 2020-11-03 平安科技(深圳)有限公司 Screenshot content detection method and device and computer-readable storage medium
CN111832082A (en) * 2020-08-20 2020-10-27 支付宝(杭州)信息技术有限公司 Image-text integrity detection method and device
CN111832082B (en) * 2020-08-20 2023-02-24 支付宝(杭州)信息技术有限公司 Image-text integrity detection method and device
CN112200598B (en) * 2020-09-08 2022-02-15 北京数美时代科技有限公司 Picture advertisement identification method and device and computer equipment
CN112200598A (en) * 2020-09-08 2021-01-08 北京数美时代科技有限公司 Picture advertisement identification method and device and computer equipment
CN111930964A (en) * 2020-09-17 2020-11-13 腾讯科技(深圳)有限公司 Content processing method, device, equipment and storage medium
WO2022156067A1 (en) * 2021-01-21 2022-07-28 平安科技(深圳)有限公司 Method and apparatus for detecting similarity between logos, electronic device, and storage medium
CN112861656A (en) * 2021-01-21 2021-05-28 平安科技(深圳)有限公司 Trademark similarity detection method and device, electronic equipment and storage medium
CN112861656B (en) * 2021-01-21 2024-05-14 平安科技(深圳)有限公司 Trademark similarity detection method and device, electronic equipment and storage medium
CN113298697A (en) * 2021-03-19 2021-08-24 广州天越电子科技有限公司 Method for converting two-dimensional graphic elements into vector graphic elements based on artificial neural network
CN113298697B (en) * 2021-03-19 2024-04-26 广州天越电子科技有限公司 Method for converting two-dimensional graphic elements into vector graphic elements based on artificial neural network
CN115019328A (en) * 2022-07-01 2022-09-06 支付宝(杭州)信息技术有限公司 Content risk identification method and system

Similar Documents

Publication Publication Date Title
CN107832765A (en) Picture recognition to including word content and picture material
CN111488826B (en) Text recognition method and device, electronic equipment and storage medium
Li et al. Box-supervised instance segmentation with level set evolution
CN108345587B (en) Method and system for detecting authenticity of comments
US10817707B2 (en) Attack sample generating method and apparatus, device and storage medium
CN107315779A (en) Log analysis method and system
CN108021806B (en) Malicious installation package identification method and device
CN108416003A (en) A kind of picture classification method and device, terminal, storage medium
US20190026367A1 (en) Navigating video scenes using cognitive insights
TW201947528A (en) Vehicle damage identification processing method, processing device, client and server
CN113780098B (en) Character recognition method, character recognition device, electronic equipment and storage medium
CN109599095A (en) A kind of mask method of voice data, device, equipment and computer storage medium
CN111046184A (en) Text risk identification method, device, server and storage medium
CN114429566A (en) Image semantic understanding method, device, equipment and storage medium
CN114359590A (en) NFT image work infringement detection method and device and computer storage medium
CN107729944B (en) Identification method and device of popular pictures, server and storage medium
CN111274390B (en) Emotion cause determining method and device based on dialogue data
CN112925905A (en) Method, apparatus, electronic device and storage medium for extracting video subtitles
US11321397B2 (en) Composition engine for analytical models
CN109669989A (en) Data verification method, system, equipment and medium
CN111738290B (en) Image detection method, model construction and training method, device, equipment and medium
CN111986259B (en) Training of pigment and text detection model, auditing method of video data and related device
CN117953426A (en) Foreign matter adhesion identification method and device for power transmission line and computer equipment
CN111259975B (en) Method and device for generating classifier and method and device for classifying text
CN112734772A (en) Image processing method, image processing apparatus, electronic device, and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180323