WO2022105027A1 - Image recognition method and system, electronic device, and storage medium - Google Patents

Image recognition method and system, electronic device, and storage medium Download PDF

Info

Publication number
WO2022105027A1
WO2022105027A1 PCT/CN2020/141047 CN2020141047W WO2022105027A1 WO 2022105027 A1 WO2022105027 A1 WO 2022105027A1 CN 2020141047 W CN2020141047 W CN 2020141047W WO 2022105027 A1 WO2022105027 A1 WO 2022105027A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
close
panoramic
panoramic image
preset
Prior art date
Application number
PCT/CN2020/141047
Other languages
French (fr)
Chinese (zh)
Inventor
高海超
Original Assignee
安徽鸿程光电有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 安徽鸿程光电有限公司 filed Critical 安徽鸿程光电有限公司
Publication of WO2022105027A1 publication Critical patent/WO2022105027A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation

Definitions

  • One or more embodiments of the present application relate to the technical field of image recognition, and in particular, to an image recognition method, system, electronic device, and storage medium.
  • image recognition technology has gradually become popular and permeated in every corner of life. Its purpose is mainly to more accurately and quickly determine the identity of the objects displayed in the target image or target area in various fields, such as face recognition, prohibited item recognition, and so on.
  • the purpose of one or more embodiments of the present application is to provide an image recognition method, system, electronic device and storage medium to solve the problem of low success rate of specific target recognition in panoramic images captured by panoramic cameras.
  • an image recognition method which includes:
  • the panoramic image includes several objects
  • the close-up image includes a preset object
  • the preset object is an object that satisfies a preset condition among the several objects
  • the identification result of the object in the corresponding area is determined according to the corresponding relationship between the corresponding area and the close-up image.
  • the comparing the close-up image and the panoramic image to determine a corresponding area in the panoramic image corresponding to the close-up image includes:
  • the comparison area of the panoramic image is used as the corresponding area.
  • the performing image recognition on the close-up image to determine the identification information of the preset object includes:
  • the feature identification is performed on the close-up image, the identity information of the preset object corresponding to the close-up image is determined, and the identity information is used as the identification information.
  • the preset object is specifically:
  • the distance to the acquisition position of the panoramic image is greater than a preset threshold.
  • the preset object is specifically:
  • the distance to the focus area of the panoramic image is greater than a preset threshold.
  • the preset object is specifically:
  • an object whose definition is smaller than a preset threshold in the panoramic image an object whose definition is smaller than a preset threshold in the panoramic image.
  • the panoramic image and the close-up image are acquired during the same period of time.
  • one or more embodiments of the present application also provide an image recognition system, the system includes: a panoramic camera, a close-up camera, and a processor; wherein,
  • the panoramic camera configured to capture a panoramic image, the panoramic image including several objects
  • the close-up camera is configured to collect a close-up image, the close-up image includes a preset object, and the preset object is an object that satisfies a preset condition among the several objects;
  • the processor is configured to perform the image recognition method as described in any of the above.
  • one or more embodiments of the present application also provide an electronic device, the electronic device includes a memory, a processor, and a computer program stored in the memory and executable on the processor, the processor executes the The image recognition method described in any one of the above is implemented in the program.
  • one or more embodiments of the present application also provide a non-transitory computer-readable storage medium, where the non-transitory computer-readable storage medium stores computer instructions, and the computer instructions are used to make the computer Perform the image recognition method as described in any of the above.
  • an image recognition method, system, electronic device, and storage medium provided by one or more embodiments of the present application include: acquiring a panoramic image and a close-up image, where the panoramic image includes several objects, and the close-up image includes A preset object, the preset object is an object that satisfies a preset condition among several objects; the close-up image and the panoramic image are compared to determine the corresponding area in the panoramic image corresponding to the close-up image, and the corresponding area includes the preset object; Image recognition is performed to determine the recognition information of the preset object; based on the recognition information, the recognition result of the object in the corresponding area is determined according to the corresponding relationship between the corresponding area and the close-up image.
  • One or more embodiments of the present application obtain a panoramic image and a close-up image of a specific object from a scene, and establish a connection between the close-up image and the panoramic image, so as to use the close-up image to identify the unclearly recognized part of the panoramic image, thereby solving the problem of solving the problem.
  • FIG. 1 is a schematic flowchart of an image recognition method proposed by one or more embodiments of the present application.
  • FIG. 2 is a schematic structural diagram of an electronic device according to one or more embodiments of the present application.
  • the recognition of objects in it is usually a panoramic image that covers the entire space by a panoramic camera, and the panoramic image is further processed. image recognition to achieve.
  • a specific person or object such as a person in the back row or far end
  • one or more embodiments of the present application propose an image recognition scheme, by separately obtaining a panoramic image and a close-up image of a specific object for a scene, and establishing a connection between the close-up image and the panoramic image, so as to utilize the close-up image.
  • Image recognition The unclear part of the panoramic image is recognized, and then the blurring of distant objects captured by the panoramic camera in a large scene is not conducive to image recognition, and the recognition error rate is high.
  • FIG. 1 it is a schematic flowchart of an image recognition method according to an embodiment of the present application, which specifically includes step 101 , step 102 , step 103 and step 104 .
  • Step 101 Obtain a panoramic image and a close-up image, where the panoramic image includes several objects, the close-up image includes a preset object, and the preset object is an object that satisfies a preset condition among the several objects.
  • the purpose of this step is to obtain a panoramic image and a close-up image of the preset object.
  • the panoramic image is the image including the whole area of the specific place, which can be obtained by a panoramic camera set in the specific place, or the images are stitched together after shooting the specific place with multiple cameras;
  • the specific scene in which things are identified such as: classroom, storage room, etc., which can be all or a specific part of the specific scene, take the classroom as an example, it can be the entire classroom, or the area other than the podium in the classroom, etc. Wait.
  • the objects are characters such as students in the classroom or objects such as paintings in the storage room.
  • a close-up image is a clear image of a preset object obtained by means of a close-up camera, etc.
  • a close-up image can be obtained by A separate image for each student at a distance, an image for all students at a distance, and so on.
  • the preset object targeted by the close-up image may be specifically adjusted according to the specific application scenario.
  • the preset object in one specific application scenario is specifically: an object whose distance to the acquisition position of the panoramic image is greater than a preset threshold among the several objects; the preset object in another specific application scenario The object, specifically: an object whose distance to the focal area of the panoramic image is greater than a preset threshold in the several objects; the preset object in another specific application scenario, specifically: the number of objects in the Objects whose sharpness is less than a preset threshold in the panoramic image, etc.
  • It may be a close-up image of a specific object designated in advance.
  • the obtained panoramic images and close-up images may be collected successively, respectively, or may be collected simultaneously.
  • the posture of a single target in the close-up image and the posture of the target in the panoramic image improve the degree of similarity between the close-up image and some areas in the panoramic image, so as to better perform the similarity comparison in the following steps, improve the Comparison success rate.
  • the panoramic image and the close-up image are acquired during the same period of time.
  • the same time period can be understood as the same time or a very small time interval, and the content of the two images obtained in the same time period will be almost the same, so that image recognition can be carried out very easily.
  • Step 102 compare the close-up image and the panoramic image, and determine a corresponding area in the panoramic image that corresponds to the close-up image, where the corresponding area includes the preset object.
  • the purpose of this step is to compare the panoramic image according to the close-up image, and determine the part corresponding to the close-up image in the panoramic image.
  • the close-up image is an image of a single object in the recognition scene or an image of the area where all the objects meet the conditions
  • the panoramic image is an image that reflects the whole picture of the recognized scene, and the close-up image must have corresponding parts in the panoramic image. Therefore, the relationship between the two can be established through image recognition and comparison technology, such as face recognition and feature extraction for close-up images, and image recognition in panoramic images with the extracted features to determine that the similarity is higher than a certain threshold. as the part corresponding to the close-up image.
  • the corresponding area is the area that can reflect the specific characteristics of the image, such as the face area in face recognition, the specific portrait area in painting recognition, the specific characteristics of contraband in contraband recognition, and so on.
  • the corresponding area may be one or more.
  • the corresponding area is a concentrated area containing specific features.
  • face recognition a frame-shaped area where faces are concentrated in a panoramic image.
  • the corresponding area is a single area where each specific feature is located.
  • the box-shaped area where each face is located in a panoramic image is a corresponding area.
  • the close-up image since the close-up image is for the preset objects that meet the conditions, it may include all the preset objects that meet the conditions through one image, or it may be obtained separately for each preset object that meets the conditions. Furthermore, the close-up image must also contain specific features of each preset object, so that by comparing and identifying with the panoramic image, a corresponding relationship between the close-up image and the corresponding area in the panoramic image can be established. Here, it can be that the whole image of the close-up image and the corresponding area in the panoramic image establish a corresponding relationship, or it can be that the comparison area in the close-up image and the corresponding comparison area in the panoramic image establish a corresponding relationship.
  • the corresponding relationship between the close-up image and the panoramic image can be the establishment of a relationship between the entire image of the close-up image and the corresponding area in the panoramic image, or It can be that the face area of each student in the close-up image establishes a corresponding relationship with the corresponding face area in the panoramic image, etc.; when the close-up image is only an image of a qualified student, the corresponding relationship established with the panoramic image can be It is the establishment of a correspondence between the face area of the student and the corresponding face area in the panoramic image, and so on.
  • the close-up image and the panoramic image can be taken at the same time or at a very short interval, the poses and features of the same person or object reflected by the two images are highly similar, and the results of the comparison of the two recognition results correspond to The accuracy rate is very high, far higher than the accuracy rate of identifying the two through images in the pre-stored identification library.
  • the pre-stored recognition library may store images such as student ID photos, while the close-up images and panoramic images are taken at the same time or at a short interval, and the students reflected in the two images All the features of the face are almost identical, and the accuracy of recognizing the panoramic image directly through the close-up image is much higher than the accuracy of recognizing the two images separately through the pre-stored recognition library.
  • the comparing the close-up image and the panoramic image to determine a corresponding area in the panoramic image corresponding to the close-up image includes: determining a comparison area of the close-up image and a ratio of the panoramic image. Compare the area; compare the similarity between the comparison area of the close-up image and the comparison area of the panoramic image; if the similarity comparison meets the set conditions, then use the comparison area of the panoramic image as the corresponding area.
  • the comparison area of the close-up image is the area that reflects the specific characteristics of the object in the close-up image, such as the face area of each student in the classroom scene, the specific characteristic area of each item in the storage room scene (such as the main body of the painting) part, etc.) etc.
  • the characteristic area (each face area) of each object in the close-up image is determined again. , and only identify each feature area in the close-up image and the panoramic image, without identifying the whole image.
  • the setting conditions may be conditions such as the similarity not less than 90%.
  • Step 103 Perform image recognition on the close-up image to determine the identification information of the preset object.
  • the purpose of this step is to identify the close-up image to determine the identification information of the preset object corresponding to the close-up image.
  • the identification information may be information identifying the preset object or the identity of the corresponding area, and at the same time, the identification information may only be quantitative statistical information, for example, if 5 preset objects are identified in the close-up image, the identification information can be identified as the number of people 5 people.
  • the image recognition in this step is similar to the image comparison in step 102, except that the comparison database is replaced.
  • a preset standard image library is used, which stores standard feature images of all objects.
  • identification information such as the identity information of the person.
  • the performing image recognition on the close-up image to determine the identification information of the preset object includes: performing feature recognition on the close-up image, and determining the preset image corresponding to the close-up image.
  • the identity information of the object is assumed, and the identity information is used as the identification information.
  • Step 104 based on the identification information, and according to the corresponding relationship between the corresponding area and the close-up image, determine the recognition result of the object in the corresponding area.
  • the purpose of this step is to correspond the identification information of the preset object in the close-up image to the corresponding object in the panoramic image according to the corresponding relationship, so as to determine the identification result of the object in the panoramic image.
  • the identification result is similar to the identification information, and both are information that can identify the identity of each object or each corresponding area.
  • the mapping relationship between the preset object in the close-up image and an object in the panoramic image can be directly established, and then according to the identification information of the preset object determined in step 103, it can be directly determined according to the mapping.
  • the recognition result of the object in the panoramic image may be the identity of the object in the panoramic image, such as the student's name, student number, ID number, and the like.
  • the identity information of specific objects or unclear objects in the panoramic image can be determined.
  • the clear objects in the panoramic image can be directly identified according to image recognition and other similar methods. recognition, and finally the recognition result of the entire panoramic image can be determined.
  • the recognition result may be information reflecting the identity or quantity of objects in the recognition scene, such as the identity information of each student in the classroom or the number of students in the classroom, the relevant identity information or quantity information of paintings in the storage room, and so on.
  • the recognition result can also be output to store, display or reprocess the recognition result.
  • the specific output mode of the recognition result can be flexibly selected.
  • the recognition result can be directly output on the display component (display, projector, etc.) of the current device in the form of display, so that the operator of the current device can The content of the recognition result can be seen directly on the display part.
  • the identification result may be sent to the system through any data communication method (wired connection, NFC, Bluetooth, wifi, cellular mobile network, etc.). other preset devices as receivers in the device, so that the preset device that has received the identification result can perform subsequent processing on it.
  • the preset device may be a preset server, and the server is generally set in the cloud, as a data processing and storage center, which can store and distribute the identification results; wherein, the recipient of the distribution is a terminal device, the The holder or operator of these terminal devices may be the current user, the relevant monitoring personnel who recognize the scene, the unit or individual related to the object in the recognized scene, and so on.
  • the identification result may be directly sent to a preset terminal device through any data communication method, and the terminal device may be one of the ones listed in the preceding paragraphs. one or more of.
  • a panoramic image is taken through a panoramic lens, and a close-up image is taken through a close-up lens.
  • the panoramic lens can scan 30 people, and the close-up lens can scan 9 people.
  • Rect1 rectangular area
  • the face set Rect2 includes: Rect21, Rect22, ..., Rect29, each face corresponds to a number, No1 to No9.
  • the numbers in Rect2 are in one-to-one correspondence with the numbers in Rect1.
  • Rect2 is a collection of Rects (rectangular areas) in the panoramic image, so that the face name name9 identified in the close-up shot corresponds to the panoramic shot.
  • the method includes: acquiring a panoramic image and a close-up image, the panoramic image includes several objects, the close-up image includes a preset object, and the preset object is one of the objects that satisfies the preset Conditional object; compare the close-up image and the panoramic image to determine the corresponding area in the panoramic image corresponding to the close-up image, and the corresponding area includes the preset object; perform image recognition on the close-up image to determine the identification information of the preset object; based on the recognition According to the corresponding relationship between the corresponding area and the close-up image, the recognition result of the object in the corresponding area is determined.
  • One or more embodiments of the present application obtain a panoramic image and a close-up image of a specific object from a scene, and establish a connection between the close-up image and the panoramic image, so as to use the close-up image to identify the unclearly recognized part of the panoramic image, thereby solving the problem of solving the problem.
  • the methods of one or more embodiments of the present application may be executed by a single device, such as a computer or a server.
  • the method in this embodiment can also be applied in a distributed scenario, and is completed by the cooperation of multiple devices.
  • one device among the multiple devices may only execute one or more steps in the method of one or more embodiments of the present application, and the multiple devices may perform operations on each other. interact to complete the described method.
  • one or more embodiments of the present application further provide an image recognition system, a panoramic camera, a close-up camera, and a processor; wherein,
  • the panoramic camera configured to capture a panoramic image, the panoramic image including several objects
  • the close-up camera is configured to collect a close-up image, the close-up image includes a preset object, and the preset object is an object that satisfies a preset condition among the several objects;
  • the processor is configured to execute an image recognition method as described in any one of the above embodiments.
  • one or more embodiments of the present application further provide an electronic device.
  • the electronic device includes a memory, a processor, and a computer program stored on the memory and running on the processor, and the processor implements the image recognition method described in any one of the above embodiments when the processor executes the program.
  • FIG. 2 shows a schematic diagram of a more specific hardware structure of an electronic device provided in this embodiment.
  • the device may include: a processor 210 , a memory 220 , an input/output interface 230 , a communication interface 240 and a bus 250 .
  • the processor 210 , the memory 220 , the input/output interface 230 and the communication interface 240 realize the communication connection among each other within the device through the bus 250 .
  • the processor 210 can be implemented by a general-purpose CPU (Central Processing Unit, central processing unit), a microprocessor, an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits, etc. program to realize the technical solutions provided by the embodiments of the present application.
  • a general-purpose CPU Central Processing Unit, central processing unit
  • a microprocessor an application specific integrated circuit (Application Specific Integrated Circuit, ASIC)
  • ASIC Application Specific Integrated Circuit
  • the memory 220 can be implemented in the form of a ROM (Read Only Memory, read-only memory), a RAM (Random Access Memory, random access memory), a static storage device, a dynamic storage device, and the like.
  • the memory 220 may store an operating system and other application programs. When implementing the technical solutions provided by the embodiments of the present application through software or firmware, relevant program codes are stored in the memory 220 and invoked by the processor 210 for execution.
  • the input/output interface 230 is used for connecting input/output modules to realize information input and output.
  • the input/output module can be configured in the device as a component (not shown in the figure), or can be externally connected to the device to provide corresponding functions.
  • the input device may include a keyboard, a mouse, a touch screen, a microphone, various sensors, etc.
  • the output device may include a display, a speaker, a vibrator, an indicator light, and the like.
  • the communication interface 240 is used to connect a communication module (not shown in the figure), so as to realize the communication interaction between the device and other devices.
  • the communication module may implement communication through wired means (eg, USB, network cable, etc.), or may implement communication through wireless means (eg, mobile network, WIFI, Bluetooth, etc.).
  • the bus 250 includes a path to transfer information between the various components of the device (eg, the processor 210, the memory 220, the input/output interface 230, and the communication interface 240).
  • the bus 250 may include an Accelerated Graphics Port (AGP) or other graphics bus, an Extended Industry Standard Architecture (EISA) bus, a Front Side Bus (FSB), Hyper Transport (HT) interconnect, Industry Standard Architecture (ISA) bus, Infiniband interconnect, Low Pin Count (LPC) bus, Memory bus, Micro Channel Architecture (MCA) bus, Peripheral components Interconnect (PCI) bus, PCI-Express (PCI-X) bus, Serial Advanced Technology Attachment (SATA) bus, Video Electronics Standards Association Local (VLB) bus or other suitable bus or two or more of these The combination.
  • Bus 250 may include one or more buses, where appropriate. Although embodiments of this application describe and illustrate a particular bus, this application contemplates any suitable bus or interconnect.
  • the above-mentioned device only shows the processor 210, the memory 220, the input/output interface 230, the communication interface 240 and the bus 250, in the specific implementation process, the device may also include necessary components for normal operation. other components.
  • the above-mentioned device may only include components necessary to implement the solutions of the embodiments of the present application, rather than all the components shown in the figures.
  • one or more embodiments of the present application further provide a non-transitory computer-readable storage medium, where the non-transitory computer-readable storage medium stores computer instructions, and the computer instructions are used to cause the The computer executes an image recognition method described in any one of the embodiments above.
  • the computer readable medium of this embodiment includes both permanent and non-permanent, removable and non-removable media and can be implemented by any method or technology for information storage.
  • Information may be computer readable instructions, data structures, modules of programs, or other data.
  • Examples of computer storage media include, but are not limited to, phase-change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read only memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Flash Memory or other memory technology, Compact Disc Read Only Memory (CD-ROM), Digital Versatile Disc (DVD) or other optical storage, Magnetic tape cartridges, magnetic tape disk storage or other magnetic storage devices or any other non-transmission medium that can be used to store information that can be accessed by a computing device.
  • PRAM phase-change memory
  • SRAM static random access memory
  • DRAM dynamic random access memory
  • RAM random access memory
  • ROM read only memory
  • EEPROM Electrically Erasable Programmable Read Only
  • the functional blocks shown in the above-described structural block diagrams may be implemented as hardware, software, firmware, or a combination thereof.
  • it can be, for example, an electronic circuit, an application specific integrated circuit (ASIC), suitable firmware, a plug-in, a function card, and the like.
  • ASIC application specific integrated circuit
  • elements of the present application are programs or code segments used to perform the required tasks.
  • the program or code segments may be stored in a machine-readable medium or transmitted over a transmission medium or communication link by a data signal carried in a carrier wave.
  • a "machine-readable medium” may include any medium that can store or transmit information.
  • machine-readable media examples include electronic circuits, semiconductor memory devices, ROM, flash memory, erasable ROM (EROM), floppy disks, CD-ROMs, optical disks, hard disks, fiber optic media, Radio Frequency (RF) links, etc. Wait.
  • the code segments may be downloaded via a computer network such as the Internet, an intranet, or the like.
  • processors may be, but are not limited to, general purpose processors, special purpose processors, application specific processors, or field programmable logic circuits. It will also be understood that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can also be implemented by special purpose hardware for performing the specified functions or actions, or by special purpose hardware and/or A combination of computer instructions is implemented.
  • the figures provided may or may not be shown in connection with integrated circuit (IC) chips and other components.
  • IC integrated circuit
  • Well known power/ground connection IC
  • devices may be shown in block diagram form in order to avoid obscuring one or more embodiments of the present application, and this also takes into account the fact that details regarding the implementation of such block diagram devices are highly dependent on the implementation of the present application (ie, these details should be well within the purview of those skilled in the art) to which one or more embodiments are claimed.
  • DRAM dynamic RAM

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Studio Devices (AREA)
  • Image Analysis (AREA)

Abstract

An image recognition method and system, an electronic device, and a storage medium. The method comprises: acquiring a panoramic image and a close-up image, the panoramic image comprising several objects, and the close-up image comprising a preset object; comparing the close-up image and the panoramic image to determine a corresponding area, that corresponds to the close-up image, in the panoramic image; performing image recognition on the close-up image to determine recognition information of the preset object; and on the basis of the recognition information and according to the correspondence between the corresponding area and the close-up image, determining a recognition result of an object in the corresponding area. A panoramic image and a close-up image of a specific object are separately acquired by means of a scenario, and a relationship between the close-up image and the panoramic image is established, to thereby recognize, by using the close-up image, a part that is not clearly recognized in the panoramic image, and thus solving the problem in which a long-distance object image captured by means of a panoramic camera in a large scene is blurry, which is unfavorable for image recognition and leads to a high recognition error rate.

Description

图像识别方法、系统、电子设备及存储介质Image recognition method, system, electronic device and storage medium
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请要求享有于2020年11月19日提交的名称为“一种图像识别方法、系统、电子设备及存储介质”的中国专利申请202011307535.5的优先权,该申请的全部内容通过引用并入本文中。This application claims the priority of Chinese Patent Application No. 202011307535.5 filed on November 19, 2020, entitled "An Image Recognition Method, System, Electronic Device and Storage Medium", the entire contents of which are incorporated herein by reference .
技术领域technical field
本申请一个或多个实施例涉及图像识别技术领域,尤其涉及一种图像识别方法、系统、电子设备及存储介质。One or more embodiments of the present application relate to the technical field of image recognition, and in particular, to an image recognition method, system, electronic device, and storage medium.
背景技术Background technique
随着现代摄像技术的发展,图像识别技术已经逐渐普及并充斥在生活中的各个角落。其目的主要是为了在各领域中更为精确快捷的确定目标图像或目标区域所展示出的事物的身份,例如:人脸识别、违禁物品识别等等。With the development of modern camera technology, image recognition technology has gradually become popular and permeated in every corner of life. Its purpose is mainly to more accurately and quickly determine the identity of the objects displayed in the target image or target area in various fields, such as face recognition, prohibited item recognition, and so on.
但现有技术在图像识别领域还存在诸多问题。以人脸识别为例,在对一个大场景中的多个对象进行对象识别时,由于距离、对焦位置等特殊原因影响,利用全景摄像头拍摄的全景图像中特定的对象会比较模糊,不利于图像识别分析,从而会造成识别失败的问题。However, there are still many problems in the field of image recognition in the prior art. Taking face recognition as an example, when recognizing multiple objects in a large scene, due to special reasons such as distance and focus position, certain objects in the panoramic image captured by the panoramic camera will be blurred, which is not conducive to the image. Identification analysis, which will cause the problem of identification failure.
发明内容SUMMARY OF THE INVENTION
有鉴于此,本申请一个或多个实施例的目的在于提出一种图像识别方法、系统、电子设备及存储介质,以解决全景摄像头拍摄的全景图像中特定目标识别成功率不高的问题。In view of this, the purpose of one or more embodiments of the present application is to provide an image recognition method, system, electronic device and storage medium to solve the problem of low success rate of specific target recognition in panoramic images captured by panoramic cameras.
基于上述目的,本申请一个或多个实施例提供了一种图像识别方法,方法包括:Based on the above purpose, one or more embodiments of the present application provide an image recognition method, which includes:
获取全景图像及特写图像,所述全景图像包含若干对象,所述特写图像包含预设对象,所述预设对象为所述若干对象中满足预设条件的对象;acquiring a panoramic image and a close-up image, the panoramic image includes several objects, the close-up image includes a preset object, and the preset object is an object that satisfies a preset condition among the several objects;
对所述特写图像及所述全景图像进行比对,确定所述全景图像中与所述特写图像对应的对应区域,所述对应区域包括所述预设对象;comparing the close-up image and the panoramic image, and determining a corresponding area in the panoramic image corresponding to the close-up image, where the corresponding area includes the preset object;
对所述特写图像进行图像识别,确定所述预设对象的识别信息;Perform image recognition on the close-up image to determine the identification information of the preset object;
基于所述识别信息,根据所述对应区域与所述特写图像的对应关系,确定所述对应区域中对象的识别结果。Based on the identification information, the identification result of the object in the corresponding area is determined according to the corresponding relationship between the corresponding area and the close-up image.
在一些实施方式中,所述对所述特写图像及所述全景图像进行比对,确定所述全景图像中与所述特写图像对应的对应区域,包括:In some embodiments, the comparing the close-up image and the panoramic image to determine a corresponding area in the panoramic image corresponding to the close-up image includes:
确定所述特写图像的比对区域及所述全景图像的比对区域;determining the comparison area of the close-up image and the comparison area of the panoramic image;
对所述特写图像的比对区域与所述全景图像的比对区域进行相似度比对;performing similarity comparison between the comparison area of the close-up image and the comparison area of the panoramic image;
若相似度比对符合设定条件,则将所述全景图像的比对区域作为所述对应区域。If the similarity comparison meets the set condition, the comparison area of the panoramic image is used as the corresponding area.
在一些实施方式中,所述对所述特写图像进行图像识别,确定所述预设对象的识别信息,包括:In some embodiments, the performing image recognition on the close-up image to determine the identification information of the preset object includes:
对所述特写图像进行特征识别,确定所述特写图像对应的所述预设对象的身份信息,将所述身份信息作为所述识别信息。The feature identification is performed on the close-up image, the identity information of the preset object corresponding to the close-up image is determined, and the identity information is used as the identification information.
在一些实施方式中,所述预设对象,具体为:In some embodiments, the preset object is specifically:
所述若干对象中到所述全景图像的获取位置的距离大于预设阈值的对象。Among the several objects, the distance to the acquisition position of the panoramic image is greater than a preset threshold.
在一些实施方式中,所述预设对象,具体为:In some embodiments, the preset object is specifically:
所述若干对象中到所述全景图像的聚焦区域的距离大于预设阈值的对象。Among the several objects, the distance to the focus area of the panoramic image is greater than a preset threshold.
在一些实施方式中,所述预设对象,具体为:In some embodiments, the preset object is specifically:
所述若干对象中在所述全景图像中清晰度小于预设阈值的对象。Among the several objects, an object whose definition is smaller than a preset threshold in the panoramic image.
在一些实施方式中,所述全景图像和所述特写图像为在同一时段内采集得到的。In some embodiments, the panoramic image and the close-up image are acquired during the same period of time.
基于同一构思,本申请一个或多个实施例还提供了一种图像识别系统,系统包括:全景摄像头、特写摄像头和处理器;其中,Based on the same concept, one or more embodiments of the present application also provide an image recognition system, the system includes: a panoramic camera, a close-up camera, and a processor; wherein,
所述全景摄像头,被配置为采集全景图像,所述全景图像包含若干对象;the panoramic camera, configured to capture a panoramic image, the panoramic image including several objects;
所述特写摄像头,被配置为采集特写图像,所述特写图像包含预设对象,所述预设对象为所述若干对象中满足预设条件的对象;the close-up camera is configured to collect a close-up image, the close-up image includes a preset object, and the preset object is an object that satisfies a preset condition among the several objects;
所述处理器,被配置为执行如上任一项所述的图像识别方法。The processor is configured to perform the image recognition method as described in any of the above.
基于同一构思,本申请一个或多个实施例还提供了一种电子设备,电子设备包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现如上任一项所述的图像识别方法。Based on the same concept, one or more embodiments of the present application also provide an electronic device, the electronic device includes a memory, a processor, and a computer program stored in the memory and executable on the processor, the processor executes the The image recognition method described in any one of the above is implemented in the program.
基于同一构思,本申请一个或多个实施例还提供了一种非暂态计算机可读存储介质,所述非暂态计算机可读存储介质存储计算机指令,所述计算机指令用于使所述计算机执行如上任一项所述的图像识别方法。Based on the same concept, one or more embodiments of the present application also provide a non-transitory computer-readable storage medium, where the non-transitory computer-readable storage medium stores computer instructions, and the computer instructions are used to make the computer Perform the image recognition method as described in any of the above.
从上面所述可以看出,本申请一个或多个实施例提供的一种图像识别方法、系统、电子设备及存储介质,包括:获取全景图像及特写图像,全景图像包含若干对象,特写图像包含预设对象,预设对象为若干对象中满足预设条件的对象;对特写图像及全景图像进行比对,确定全景图像中与特写图像对应的对应区域,对应区域包括预设对象;对特写图像进行图像识别,确定预设对象的识别信息;基于识别信息,根据对应区域与特写图像的对应关系,确定对应区域中对象的识别结果。本申请一个或多个实施例通过对场景分别获取全景图像及特定对象的特写图像,并建立特写图像与全景图像之间的联系,从而利用特写图像识别全景图像中识别不清楚的部分,进而解决了在大场景中通过全景摄像头拍摄远距离对象模糊,不利于图像识别,识别错误率高的问题。As can be seen from the above, an image recognition method, system, electronic device, and storage medium provided by one or more embodiments of the present application include: acquiring a panoramic image and a close-up image, where the panoramic image includes several objects, and the close-up image includes A preset object, the preset object is an object that satisfies a preset condition among several objects; the close-up image and the panoramic image are compared to determine the corresponding area in the panoramic image corresponding to the close-up image, and the corresponding area includes the preset object; Image recognition is performed to determine the recognition information of the preset object; based on the recognition information, the recognition result of the object in the corresponding area is determined according to the corresponding relationship between the corresponding area and the close-up image. One or more embodiments of the present application obtain a panoramic image and a close-up image of a specific object from a scene, and establish a connection between the close-up image and the panoramic image, so as to use the close-up image to identify the unclearly recognized part of the panoramic image, thereby solving the problem of solving the problem. This solves the problem of blurring distant objects captured by a panoramic camera in a large scene, which is not conducive to image recognition and has a high recognition error rate.
附图说明Description of drawings
为了更清楚地说明本申请一个或多个实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请一个或多个实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate one or more embodiments of the present application or the technical solutions in the prior art, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, in the following description The accompanying drawings are only one or more embodiments of the present application, and for those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative efforts.
图1为本申请一个或多个实施例提出的一种图像识别方法的流程示意图;1 is a schematic flowchart of an image recognition method proposed by one or more embodiments of the present application;
图2为本申请一个或多个实施例提出的电子设备的结构示意图。FIG. 2 is a schematic structural diagram of an electronic device according to one or more embodiments of the present application.
具体实施方式Detailed ways
为使本申请的目的、技术方案和优点更加清楚明白,以下结合具体实施例,并参照附图,对本申请进一步详细说明。In order to make the objectives, technical solutions and advantages of the present application more clearly understood, the present application will be further described in detail below with reference to specific embodiments and accompanying drawings.
需要说明的是,除非另外定义,本申请实施例使用的技术术语或者科学术语应当为本公开所属领域内具有一般技能的人士所理解的通常意义。本公开中使用的“第一”、“第二”以及类似的词语并不表示任何顺序、数量或者重要性,而只是用来区分不同的组成部分。“包括”或者“包含”等类似的词语意指出现该词前面的元件、物件或者方法步骤涵盖出现在该词后面列举的元件、物件或者方法步骤及其等同,而不排除其他元件、物件或者方法步骤。“连接”或者“相连”等类似的词语并非限定于物理的或者机械的连接,而是可以包括电性的连接,不管是直接的还是间接的。“上”、“下”、“左”、“右”等仅用于表示相对位置关系,当被描述对象的绝对位置改变后,则该相对位置关系也可能相应地改变。It should be noted that, unless otherwise defined, the technical terms or scientific terms used in the embodiments of the present application shall have the usual meanings understood by those with ordinary skill in the art to which this disclosure belongs. As used in this disclosure, "first," "second," and similar terms do not denote any order, quantity, or importance, but are merely used to distinguish the various components. "Comprising" or "comprising" and similar words mean that the elements, things, or method steps appearing before the word cover the elements, things, or method steps listed after the word, and their equivalents, but do not exclude other elements, things, or method steps. method steps. Words like "connected" or "connected" are not limited to physical or mechanical connections, but may include electrical connections, whether direct or indirect. "Up", "Down", "Left", "Right", etc. are only used to represent the relative positional relationship, and when the absolute position of the described object changes, the relative positional relationship may also change accordingly.
如背景技术部分所述,在一个大场景、大空间中,例如:大型教室、大型储物室等,对其中对象进行识别通常是通过全景摄像头拍摄囊括整个空间的全景图像,对全景图像进行进一步的图像识别来实现的。而对其中对象进行识别时,由于距离、对焦位置等特殊原因,特定的人或物(例如后排或远端的人)在全景图像中是较小或较模糊的,不利于图像的识别,很容易造成识别错误或识别失败的情况,非常影响图像识别的准确率。As described in the background art section, in a large scene and a large space, such as a large classroom, a large storage room, etc., the recognition of objects in it is usually a panoramic image that covers the entire space by a panoramic camera, and the panoramic image is further processed. image recognition to achieve. When recognizing the objects in it, due to special reasons such as distance and focus position, a specific person or object (such as a person in the back row or far end) is small or blurred in the panoramic image, which is not conducive to image recognition. It is easy to cause recognition errors or recognition failures, which greatly affects the accuracy of image recognition.
结合上述实际情况,本申请一个或多个实施例提出了一种图像识别方案,通过对场景分别获取全景图像及特定对象的特写图像,并建立特写图像与全景图像之间的联系,从而利用特写图像识别全景图像中识别不清楚的部分,进而实现了在大场景中通过全景摄像头拍摄远距离对象模糊,不利于图像识别,识别错误率高的问题。Combined with the above-mentioned actual situation, one or more embodiments of the present application propose an image recognition scheme, by separately obtaining a panoramic image and a close-up image of a specific object for a scene, and establishing a connection between the close-up image and the panoramic image, so as to utilize the close-up image. Image recognition The unclear part of the panoramic image is recognized, and then the blurring of distant objects captured by the panoramic camera in a large scene is not conducive to image recognition, and the recognition error rate is high.
参考图1所示,为本申请一个实施例的一种图像识别方法的流程示意图,具体包括步骤101、步骤102、步骤103和步骤104。Referring to FIG. 1 , it is a schematic flowchart of an image recognition method according to an embodiment of the present application, which specifically includes step 101 , step 102 , step 103 and step 104 .
步骤101,获取全景图像及特写图像,所述全景图像包含若干对象,所述特写图像包含预设对象,所述预设对象为所述若干对象中满足预设条件的对象。Step 101: Obtain a panoramic image and a close-up image, where the panoramic image includes several objects, the close-up image includes a preset object, and the preset object is an object that satisfies a preset condition among the several objects.
本步骤旨在,获取全景图像,及预设对象的特写图像。其中,全景图像即为包括特定场所的全部区域的图像,其可以通过设置于特定场所内的全景摄像头获取,或是通过多个摄像头拍摄特定场所后将图像拼接而成;特定场所即为需要对其中事物进行识别的具体场景,例如:教室、储物室等,其可以是具体场景的全部或特定部分,以教室为例,其可以是整个教室、也可以是教室中除讲台外的区域等等。对象即为教室中的学生等人物或储物室中的画作等物品。特写图像,为通过特写摄像头等方式获取的预设对象的清晰图像,其可以对每个预设对象进行单独拍摄获取,也可以对所有预设对象进行统一获取,以教室为例,特写图像可以是远距离的每个学生的单独图像,也可以是远距离的所有学生的图像等等。The purpose of this step is to obtain a panoramic image and a close-up image of the preset object. Among them, the panoramic image is the image including the whole area of the specific place, which can be obtained by a panoramic camera set in the specific place, or the images are stitched together after shooting the specific place with multiple cameras; The specific scene in which things are identified, such as: classroom, storage room, etc., which can be all or a specific part of the specific scene, take the classroom as an example, it can be the entire classroom, or the area other than the podium in the classroom, etc. Wait. The objects are characters such as students in the classroom or objects such as paintings in the storage room. A close-up image is a clear image of a preset object obtained by means of a close-up camera, etc. It can be obtained by shooting each preset object individually, or it can be obtained uniformly for all preset objects. Taking a classroom as an example, a close-up image can be obtained by A separate image for each student at a distance, an image for all students at a distance, and so on.
在一些应用场景中,特写图像所针对的预设对象可以根据具体的应用场景进行具体的调整。例如,一种具体应用场景中所述预设对象,具体为:所述若干对象中到所述全景图像的获取位置的距离大于预设阈值的对象;另一种具体应用场景中所述预设对象,具体为:所述若干对象中到所述全景图像的聚焦区域的距离大于预设阈值的对象;另一种具体应用场景中所述预设对象,具体为:所述若干对象中在所述全景图像中清晰度小于预设阈值的对象等等。其更可以是事先指定的特定对象的特写图像。In some application scenarios, the preset object targeted by the close-up image may be specifically adjusted according to the specific application scenario. For example, the preset object in one specific application scenario is specifically: an object whose distance to the acquisition position of the panoramic image is greater than a preset threshold among the several objects; the preset object in another specific application scenario The object, specifically: an object whose distance to the focal area of the panoramic image is greater than a preset threshold in the several objects; the preset object in another specific application scenario, specifically: the number of objects in the Objects whose sharpness is less than a preset threshold in the panoramic image, etc. It may be a close-up image of a specific object designated in advance.
在一些应用场景中,获取到的全景图像和特写图像可以是分别先后采集到的,也可以是同时采集到的。而为了使特写图像中单个目标的体态与 全景图像中该目标的体态尽可能一致,提高特写图像与全景图像中部分区域的相似程度,从而更好的进行之后步骤中的相似度比对,提高比对成功率。在一些实施例中,所述全景图像和所述特写图像为在同一时段内采集得到的。In some application scenarios, the obtained panoramic images and close-up images may be collected successively, respectively, or may be collected simultaneously. In order to make the posture of a single target in the close-up image and the posture of the target in the panoramic image as consistent as possible, improve the degree of similarity between the close-up image and some areas in the panoramic image, so as to better perform the similarity comparison in the following steps, improve the Comparison success rate. In some embodiments, the panoramic image and the close-up image are acquired during the same period of time.
其中,同一时段内既可以理解为同一时间或相差非常微小的时间间隔,同一时间段内获取到的两张图像,其内容会近乎相同,从而可以非常轻易的进行图像识别。Among them, the same time period can be understood as the same time or a very small time interval, and the content of the two images obtained in the same time period will be almost the same, so that image recognition can be carried out very easily.
步骤102,对所述特写图像及所述全景图像进行比对,确定所述全景图像中与所述特写图像对应的对应区域,所述对应区域包括所述预设对象。 Step 102 , compare the close-up image and the panoramic image, and determine a corresponding area in the panoramic image that corresponds to the close-up image, where the corresponding area includes the preset object.
本步骤旨在,根据特写图像对全景图像进行比对,确定出全景图像中与特写图像对应的部分。根据步骤101,特写图像为识别场景中单一对象的图像或全部满足条件对象所在区域的图像,而全景图像是反应识别场景全貌的图像,进而特写图像在全景图像中必定有相互对应的部分。从而可以通过图像识别比对技术建立两者之间的关系,例如对特写图像进行人脸识别并进行特征提取,并以提取到的特征在全景图像中进行图像识别,确定相似度高于一定阈值的部分作为与该特写图像对应的部分。The purpose of this step is to compare the panoramic image according to the close-up image, and determine the part corresponding to the close-up image in the panoramic image. According to step 101, the close-up image is an image of a single object in the recognition scene or an image of the area where all the objects meet the conditions, and the panoramic image is an image that reflects the whole picture of the recognized scene, and the close-up image must have corresponding parts in the panoramic image. Therefore, the relationship between the two can be established through image recognition and comparison technology, such as face recognition and feature extraction for close-up images, and image recognition in panoramic images with the extracted features to determine that the similarity is higher than a certain threshold. as the part corresponding to the close-up image.
对应区域即为能够反应图像特定特征的区域,例如人脸识别中的人脸区域、画作识别中特定画像区域、违禁品识别中违禁品的特定特征等等。在全景图像中,对应区域可以是一个也可以是多个,在一些应用场景中,对应区域为包含特定特征的集中区域,例如人脸识别中,在全景图像里人脸集中分布的框型区域;在另一些应用场景中,对应区域为每个特定特征所在的单个区域,例如人脸识别中,在全景图像里每个人脸所在的框型区域均为一个对应区域。之后,由于特写图像是针对符合条件的预设对象的,其可以是通过一张图像囊括所有符合条件的预设对象,也可以是对每个符合条件的预设对象进行单独获取。进而特写图像中也必定包含每个预设对象的特定特征,从而通过与全景图像进行比对识别,可以建立特写图像与全景图像中的对应区域的对应关系。这里,可以是特写图像整个图像与全景图像中的对应区域建立对应关系,也可以是特写图像中的比对区域 与全景图像中对应的比对区域之间建立对应关系。例如:在教室人脸识别场景中,当特写图像为一张包括全部符合条件学生的图像时,其与全景图像建立的对应关系可以是特写图像整个图像与全景图像中对应的区域建立关系,也可以是特写图像中每个学生的人脸区域与全景图像中对应的人脸区域建立对应关系等等;当特写图像仅为一个符合条件学生的图像时,其与全景图像建立的对应关系,可以是该学生的人脸区域与全景图像中对应的人脸区域建立对应关系等等。The corresponding area is the area that can reflect the specific characteristics of the image, such as the face area in face recognition, the specific portrait area in painting recognition, the specific characteristics of contraband in contraband recognition, and so on. In a panoramic image, the corresponding area may be one or more. In some application scenarios, the corresponding area is a concentrated area containing specific features. For example, in face recognition, a frame-shaped area where faces are concentrated in a panoramic image. ; In other application scenarios, the corresponding area is a single area where each specific feature is located. For example, in face recognition, the box-shaped area where each face is located in a panoramic image is a corresponding area. Afterwards, since the close-up image is for the preset objects that meet the conditions, it may include all the preset objects that meet the conditions through one image, or it may be obtained separately for each preset object that meets the conditions. Furthermore, the close-up image must also contain specific features of each preset object, so that by comparing and identifying with the panoramic image, a corresponding relationship between the close-up image and the corresponding area in the panoramic image can be established. Here, it can be that the whole image of the close-up image and the corresponding area in the panoramic image establish a corresponding relationship, or it can be that the comparison area in the close-up image and the corresponding comparison area in the panoramic image establish a corresponding relationship. For example: in the classroom face recognition scene, when the close-up image is an image including all eligible students, the corresponding relationship between the close-up image and the panoramic image can be the establishment of a relationship between the entire image of the close-up image and the corresponding area in the panoramic image, or It can be that the face area of each student in the close-up image establishes a corresponding relationship with the corresponding face area in the panoramic image, etc.; when the close-up image is only an image of a qualified student, the corresponding relationship established with the panoramic image can be It is the establishment of a correspondence between the face area of the student and the corresponding face area in the panoramic image, and so on.
在此,由于特写图像及全景图像可以是同一时间或间隔极短时间拍摄到的,两个图像反应的同一人或物的姿态和特征是高度相似的,进而两者比对的识别对应的结果准确率非常高,远高于通过预存的识别库中的图像对两者进行识别的准确率。以教室中对学生人脸识别为例,预存识别库中存储的可能都是学生的证件照等图像,而特写图像及全景图像当同一时间或间隔时间很短拍摄时,两图像中反应的学生人脸的全部特征近乎是等同的,进而直接通过特写图像对全景图像进行识别的准确率远高于通过预存识别库分别对两个图像进行识别的准确率。Here, since the close-up image and the panoramic image can be taken at the same time or at a very short interval, the poses and features of the same person or object reflected by the two images are highly similar, and the results of the comparison of the two recognition results correspond to The accuracy rate is very high, far higher than the accuracy rate of identifying the two through images in the pre-stored identification library. Taking the face recognition of students in the classroom as an example, the pre-stored recognition library may store images such as student ID photos, while the close-up images and panoramic images are taken at the same time or at a short interval, and the students reflected in the two images All the features of the face are almost identical, and the accuracy of recognizing the panoramic image directly through the close-up image is much higher than the accuracy of recognizing the two images separately through the pre-stored recognition library.
在一些应用场景中,为了准确确定全景图像中与特写图像对应的对应区域,仅对特定需要的区域进行识别,减少运算量。所述对所述特写图像及所述全景图像进行比对,确定所述全景图像中与所述特写图像对应的对应区域,包括:确定所述特写图像的比对区域及所述全景图像的比对区域;对所述特写图像的比对区域与所述全景图像的比对区域进行相似度比对;若相似度比对符合设定条件,则将所述全景图像的比对区域作为所述对应区域。In some application scenarios, in order to accurately determine the corresponding area in the panoramic image that corresponds to the close-up image, only the specific required area is identified to reduce the amount of computation. The comparing the close-up image and the panoramic image to determine a corresponding area in the panoramic image corresponding to the close-up image includes: determining a comparison area of the close-up image and a ratio of the panoramic image. Compare the area; compare the similarity between the comparison area of the close-up image and the comparison area of the panoramic image; if the similarity comparison meets the set conditions, then use the comparison area of the panoramic image as the corresponding area.
其中,特写图像的比对区域即为反应特写图像中对象具体特征的区域,例如教室场景下每个学生的人脸区域、储物室场景下每个物品的特定特征区域(如画作的画作主体部分等等)等。而为了减少运算量,不用对整个图像或整个区域进行识别,仅对特定的区域进行识别,在一些应用场景中,将特写图像中的每个对象的特征区域(每个人脸区域)再次确定出来,仅对特写图像与全景图像中的每个特征区域进行识别,不用对全图进行识别。Among them, the comparison area of the close-up image is the area that reflects the specific characteristics of the object in the close-up image, such as the face area of each student in the classroom scene, the specific characteristic area of each item in the storage room scene (such as the main body of the painting) part, etc.) etc. In order to reduce the amount of computation, instead of identifying the entire image or the entire area, only a specific area is identified. In some application scenarios, the characteristic area (each face area) of each object in the close-up image is determined again. , and only identify each feature area in the close-up image and the panoramic image, without identifying the whole image.
之后,设定条件可以为相似度不小于90%等条件。After that, the setting conditions may be conditions such as the similarity not less than 90%.
步骤103,对所述特写图像进行图像识别,确定所述预设对象的识别信息。Step 103: Perform image recognition on the close-up image to determine the identification information of the preset object.
本步骤旨在,对特写图像进行识别确定出特写图像对应的预设对象的识别信息。其中,识别信息可以为标识预设对象或对应区域身份的信息,同时,识别信息还可以仅是数量统计信息,例如特写图像中识别出来5个预设对象,则识别信息即可以为识别人数5人。The purpose of this step is to identify the close-up image to determine the identification information of the preset object corresponding to the close-up image. Wherein, the identification information may be information identifying the preset object or the identity of the corresponding area, and at the same time, the identification information may only be quantitative statistical information, for example, if 5 preset objects are identified in the close-up image, the identification information can be identified as the number of people 5 people.
此步骤的图像识别与步骤102中图像比对类似,其只是更换了对比的数据库,这里使用预设的标准图像库,里面存储着所有对象的标准特征图像。基于标准图像库对特写图像进行图像识别,确定出其中包含的预设对象的具体身份信息,例如通过人脸识别技术对特写图像进行人脸识别,确定出特写图像中单个或全部人脸所对应的人员的身份信息等识别信息。进而,在一些应用场景中,所述对所述特写图像进行图像识别,确定所述预设对象的识别信息,包括:对所述特写图像进行特征识别,确定所述特写图像对应的所述预设对象的身份信息,将所述身份信息作为所述识别信息。The image recognition in this step is similar to the image comparison in step 102, except that the comparison database is replaced. Here, a preset standard image library is used, which stores standard feature images of all objects. Perform image recognition on the close-up image based on the standard image library, and determine the specific identity information of the preset objects contained therein. identification information such as the identity information of the person. Further, in some application scenarios, the performing image recognition on the close-up image to determine the identification information of the preset object includes: performing feature recognition on the close-up image, and determining the preset image corresponding to the close-up image. The identity information of the object is assumed, and the identity information is used as the identification information.
步骤104,基于所述识别信息,根据所述对应区域与所述特写图像的对应关系,确定所述对应区域中对象的识别结果。 Step 104 , based on the identification information, and according to the corresponding relationship between the corresponding area and the close-up image, determine the recognition result of the object in the corresponding area.
本步骤旨在,根据对应关系将特写图像中预设对象的识别信息对应到全景图像中对应对象上,从而确定出全景图像中该对象的识别结果。识别结果与识别信息类似,均为可以标识每个对象或每个对应区域身份的信息。The purpose of this step is to correspond the identification information of the preset object in the close-up image to the corresponding object in the panoramic image according to the corresponding relationship, so as to determine the identification result of the object in the panoramic image. The identification result is similar to the identification information, and both are information that can identify the identity of each object or each corresponding area.
根据步骤102中确定的对应关系,可以直接建立特写图像中预设对象与全景图像中某个对象的映射关系,再根据步骤103中确定的预设对象的识别信息,根据映射就可以直接确定出全景图像中这个对象的识别结果,即可以是全景图像中该对象的身份,例如:学生的姓名、学号、身份证号等。According to the corresponding relationship determined in step 102, the mapping relationship between the preset object in the close-up image and an object in the panoramic image can be directly established, and then according to the identification information of the preset object determined in step 103, it can be directly determined according to the mapping. The recognition result of the object in the panoramic image may be the identity of the object in the panoramic image, such as the student's name, student number, ID number, and the like.
至此,就可以确定出全景图像中特定对象或不清楚对象的身份信息了,之后,对全景图像中即可清晰分辨的对象,可以根据图像识别等类似 方法直接对全景图像中清晰的对象进行身份识别,最终可以确定出整个全景图像的识别结果。其中,识别结果可以为反应识别场景中对象身份或数量的信息,例如教室中每个学生的身份信息或教室中学生的数量信息、储藏室中画作的相关身份信息或数量信息等等。At this point, the identity information of specific objects or unclear objects in the panoramic image can be determined. After that, for the objects that can be clearly distinguished in the panoramic image, the clear objects in the panoramic image can be directly identified according to image recognition and other similar methods. recognition, and finally the recognition result of the entire panoramic image can be determined. The recognition result may be information reflecting the identity or quantity of objects in the recognition scene, such as the identity information of each student in the classroom or the number of students in the classroom, the relevant identity information or quantity information of paintings in the storage room, and so on.
最后,还可以输出该识别结果,用以存储、展示或再加工识别结果。根据不同的应用场景和实施需要,具体的对于识别结果的输出方式可以灵活选择。Finally, the recognition result can also be output to store, display or reprocess the recognition result. According to different application scenarios and implementation needs, the specific output mode of the recognition result can be flexibly selected.
例如,对于本实施例的方法在单一设备上执行的应用场景,可以将识别结果直接在当前设备的显示部件(显示器、投影仪等)上以显示的方式输出,使得当前设备的操作者能够从显示部件上直接看到识别结果的内容。For example, for an application scenario in which the method of this embodiment is executed on a single device, the recognition result can be directly output on the display component (display, projector, etc.) of the current device in the form of display, so that the operator of the current device can The content of the recognition result can be seen directly on the display part.
又如,对于本实施例的方法在多个设备组成的系统上执行的应用场景,可以将识别结果通过任意的数据通信方式(有线连接、NFC、蓝牙、wifi、蜂窝移动网络等)发送至系统内的其他作为接收方的预设设备上,以使得接收到识别结果的预设设备可以对其进行后续处理。可选的,该预设设备可以是预设的服务器,服务器一般设置在云端,作为数据的处理和存储中心,其能够对识别结果进行存储和分发;其中,分发的接收方是终端设备,该些终端设备的持有者或操作者可以是当前用户、识别场景的相关监控人员、与识别场景中对象相关的单位或个人等等。For another example, for an application scenario in which the method of this embodiment is executed on a system composed of multiple devices, the identification result may be sent to the system through any data communication method (wired connection, NFC, Bluetooth, wifi, cellular mobile network, etc.). other preset devices as receivers in the device, so that the preset device that has received the identification result can perform subsequent processing on it. Optionally, the preset device may be a preset server, and the server is generally set in the cloud, as a data processing and storage center, which can store and distribute the identification results; wherein, the recipient of the distribution is a terminal device, the The holder or operator of these terminal devices may be the current user, the relevant monitoring personnel who recognize the scene, the unit or individual related to the object in the recognized scene, and so on.
再如,对于本实施例的方法在多个设备组成的系统上执行的应用场景时,可以将识别结果通过任意的数据通信方式直接发送至预设的终端设备,终端设备可以是前述段落列举中的一种或多种。For another example, for an application scenario where the method of this embodiment is executed on a system composed of multiple devices, the identification result may be directly sent to a preset terminal device through any data communication method, and the terminal device may be one of the ones listed in the preceding paragraphs. one or more of.
在具体应用场景中,以教室为识别场景为例,通过全景镜头拍摄全景图像,通过特写镜头拍摄特写图像,假设全部学生30个,全景镜头可扫描到30人,特写镜头可扫描到9人。In a specific application scenario, taking the classroom as an example of recognition, a panoramic image is taken through a panoramic lens, and a close-up image is taken through a close-up lens. Assuming that there are 30 students, the panoramic lens can scan 30 people, and the close-up lens can scan 9 people.
对特写镜头图片进行人脸检测,会得到检测人脸的Rect1(矩形区域)集合,如检测到9个,记为Rect11,Rect12,…,Rect19。Performing face detection on the close-up picture will get the Rect1 (rectangular area) set of detected faces. If 9 are detected, record them as Rect11, Rect12, ..., Rect19.
对Rect11至Rect19进行特征值提取,并将提取到的特征值连同编号 No1~No9注册到人脸识别模型中。对全景图片进行人脸识别,因为是同一时刻或相近时刻的图像,全景图像的人脸与特写图像中的人脸相似度匹配会非常高,提取相似度90%以上的匹配人脸作为全景人脸集合Rect2,包括:Rect21,Rect22,…,Rect29,每张人脸对应一个编号,No1~No9。其中Rect2中的编号和Rect1中的编号是一一对应的。Extract eigenvalues from Rect11 to Rect19, and register the extracted eigenvalues in the face recognition model together with the numbers No1 to No9. Perform face recognition on panoramic images, because they are images at the same time or at a similar time, the similarity between the face in the panoramic image and the face in the close-up image will be very high, and the matching face with a similarity of more than 90% is extracted as the panoramic person. The face set Rect2 includes: Rect21, Rect22, ..., Rect29, each face corresponds to a number, No1 to No9. The numbers in Rect2 are in one-to-one correspondence with the numbers in Rect1.
对人脸识别模型重新加载预设的30个学生的人脸库样本特征值,此特征值对应了人员姓名。对特写镜头人脸进行人脸识别,识别出人脸集合RECT3(Rect31,Rect32,…,Rect39)集合和对应的人员姓名(name1,name2,…,name9)。进而可以对应到Rect1集合中,而Rect11的编号No1对应Rect21的编号No1,这样通过编号可以找到Rect11对应Rect21,进而可以对应到Rect2集合中。Reload the preset 30 student face database sample eigenvalues to the face recognition model, and this eigenvalue corresponds to the person's name. Perform face recognition on the close-up face, and identify the face set RECT3 (Rect31, Rect32, ..., Rect39) set and the corresponding personnel names (name1, name2, ..., name9). Then it can correspond to the Rect1 set, and the number No1 of Rect11 corresponds to the number No1 of Rect21, so that Rect11 can be found corresponding to Rect21 through the number, and then it can correspond to the Rect2 set.
Rect2是全景图像中的Rect(矩形区域)集合,这样,就将特写镜头中识别出的人脸名称name9对应到全景镜头中。Rect2 is a collection of Rects (rectangular areas) in the panoramic image, so that the face name name9 identified in the close-up shot corresponds to the panoramic shot.
再对全景图像其他区域的人脸进行识别,整合成整个教室学生的识别结果。Then, the faces in other areas of the panoramic image are recognized and integrated into the recognition results of the students in the entire classroom.
通过应用本申请一个或多个实施例提供的一种图像识别方法,包括:获取全景图像及特写图像,全景图像包含若干对象,特写图像包含预设对象,预设对象为若干对象中满足预设条件的对象;对特写图像及全景图像进行比对,确定全景图像中与特写图像对应的对应区域,对应区域包括预设对象;对特写图像进行图像识别,确定预设对象的识别信息;基于识别信息,根据对应区域与特写图像的对应关系,确定对应区域中对象的识别结果。本申请一个或多个实施例通过对场景分别获取全景图像及特定对象的特写图像,并建立特写图像与全景图像之间的联系,从而利用特写图像识别全景图像中识别不清楚的部分,进而解决了在大场景中通过全景摄像头拍摄远距离对象模糊,不利于图像识别,识别错误率高的问题。By applying an image recognition method provided by one or more embodiments of the present application, the method includes: acquiring a panoramic image and a close-up image, the panoramic image includes several objects, the close-up image includes a preset object, and the preset object is one of the objects that satisfies the preset Conditional object; compare the close-up image and the panoramic image to determine the corresponding area in the panoramic image corresponding to the close-up image, and the corresponding area includes the preset object; perform image recognition on the close-up image to determine the identification information of the preset object; based on the recognition According to the corresponding relationship between the corresponding area and the close-up image, the recognition result of the object in the corresponding area is determined. One or more embodiments of the present application obtain a panoramic image and a close-up image of a specific object from a scene, and establish a connection between the close-up image and the panoramic image, so as to use the close-up image to identify the unclearly recognized part of the panoramic image, thereby solving the problem of solving the problem. This solves the problem of blurring distant objects captured by a panoramic camera in a large scene, which is not conducive to image recognition and has a high recognition error rate.
需要说明的是,本申请一个或多个实施例的方法可以由单个设备执行,例如一台计算机或服务器等。本实施例的方法也可以应用于分布式场景下,由多台设备相互配合来完成。在这种分布式场景的情况下,这多台 设备中的一台设备可以只执行本申请一个或多个实施例的方法中的某一个或多个步骤,这多台设备相互之间会进行交互以完成所述的方法。It should be noted that, the methods of one or more embodiments of the present application may be executed by a single device, such as a computer or a server. The method in this embodiment can also be applied in a distributed scenario, and is completed by the cooperation of multiple devices. In the case of such a distributed scenario, one device among the multiple devices may only execute one or more steps in the method of one or more embodiments of the present application, and the multiple devices may perform operations on each other. interact to complete the described method.
上述对本申请特定实施例进行了描述。其它实施例在所附权利要求书的范围内。在一些情况下,在权利要求书中记载的动作或步骤可以按照不同于实施例中的顺序来执行并且仍然可以实现期望的结果。另外,在附图中描绘的过程不一定要求示出的特定顺序或者连续顺序才能实现期望的结果。在某些实施方式中,多任务处理和并行处理也是可以的或者可能是有利的。The foregoing describes specific embodiments of the present application. Other embodiments are within the scope of the appended claims. In some cases, the actions or steps recited in the claims can be performed in an order different from that in the embodiments and still achieve desirable results. Additionally, the processes depicted in the figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.
基于同一发明构思,本申请一个或多个实施例还提供了一种图像识别系统,全景摄像头、特写摄像头和处理器;其中,Based on the same inventive concept, one or more embodiments of the present application further provide an image recognition system, a panoramic camera, a close-up camera, and a processor; wherein,
所述全景摄像头,被配置为采集全景图像,所述全景图像包含若干对象;the panoramic camera, configured to capture a panoramic image, the panoramic image including several objects;
所述特写摄像头,被配置为采集特写图像,所述特写图像包含预设对象,所述预设对象为所述若干对象中满足预设条件的对象;the close-up camera is configured to collect a close-up image, the close-up image includes a preset object, and the preset object is an object that satisfies a preset condition among the several objects;
所述处理器,被配置为执行如上任意一实施例所述的一种图像识别方法。The processor is configured to execute an image recognition method as described in any one of the above embodiments.
上述实施例的系统用于实现前述实施例中相应的方法,并且具有相应的方法实施例的有益效果,在此不再赘述。The systems in the foregoing embodiments are used to implement the corresponding methods in the foregoing embodiments, and have the beneficial effects of the corresponding method embodiments, which will not be repeated here.
基于同一发明构思,本申请一个或多个实施例还提供了一种电子设备。该电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现如上任意一实施例所述的一种图像识别方法。Based on the same inventive concept, one or more embodiments of the present application further provide an electronic device. The electronic device includes a memory, a processor, and a computer program stored on the memory and running on the processor, and the processor implements the image recognition method described in any one of the above embodiments when the processor executes the program.
图2示出了本实施例所提供的一种更为具体的电子设备硬件结构示意图,该设备可以包括:处理器210、存储器220、输入/输出接口230、通信接口240和总线250。其中处理器210、存储器220、输入/输出接口230和通信接口240通过总线250实现彼此之间在设备内部的通信连接。FIG. 2 shows a schematic diagram of a more specific hardware structure of an electronic device provided in this embodiment. The device may include: a processor 210 , a memory 220 , an input/output interface 230 , a communication interface 240 and a bus 250 . The processor 210 , the memory 220 , the input/output interface 230 and the communication interface 240 realize the communication connection among each other within the device through the bus 250 .
处理器210可以采用通用的CPU(Central Processing Unit,中央处理 器)、微处理器、应用专用集成电路(Application Specific Integrated Circuit,ASIC)、或者一个或多个集成电路等方式实现,用于执行相关程序,以实现本申请实施例所提供的技术方案。The processor 210 can be implemented by a general-purpose CPU (Central Processing Unit, central processing unit), a microprocessor, an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits, etc. program to realize the technical solutions provided by the embodiments of the present application.
存储器220可以采用ROM(Read Only Memory,只读存储器)、RAM(Random Access Memory,随机存取存储器)、静态存储设备,动态存储设备等形式实现。存储器220可以存储操作系统和其他应用程序,在通过软件或者固件来实现本申请实施例所提供的技术方案时,相关的程序代码保存在存储器220中,并由处理器210来调用执行。The memory 220 can be implemented in the form of a ROM (Read Only Memory, read-only memory), a RAM (Random Access Memory, random access memory), a static storage device, a dynamic storage device, and the like. The memory 220 may store an operating system and other application programs. When implementing the technical solutions provided by the embodiments of the present application through software or firmware, relevant program codes are stored in the memory 220 and invoked by the processor 210 for execution.
输入/输出接口230用于连接输入/输出模块,以实现信息输入及输出。输入/输出模块可以作为组件配置在设备中(图中未示出),也可以外接于设备以提供相应功能。其中输入设备可以包括键盘、鼠标、触摸屏、麦克风、各类传感器等,输出设备可以包括显示器、扬声器、振动器、指示灯等。The input/output interface 230 is used for connecting input/output modules to realize information input and output. The input/output module can be configured in the device as a component (not shown in the figure), or can be externally connected to the device to provide corresponding functions. The input device may include a keyboard, a mouse, a touch screen, a microphone, various sensors, etc., and the output device may include a display, a speaker, a vibrator, an indicator light, and the like.
通信接口240用于连接通信模块(图中未示出),以实现本设备与其他设备的通信交互。其中通信模块可以通过有线方式(例如USB、网线等)实现通信,也可以通过无线方式(例如移动网络、WIFI、蓝牙等)实现通信。The communication interface 240 is used to connect a communication module (not shown in the figure), so as to realize the communication interaction between the device and other devices. The communication module may implement communication through wired means (eg, USB, network cable, etc.), or may implement communication through wireless means (eg, mobile network, WIFI, Bluetooth, etc.).
总线250包括一通路,在设备的各个组件(例如处理器210、存储器220、输入/输出接口230和通信接口240)之间传输信息。举例来说而非限制,总线250可包括加速图形端口(Accelerated Graphics Port,AGP)或其他图形总线、增强工业标准架构(Extended Industry Standard Architecture,EISA)总线、前端总线(Front Side Bus,FSB)、超传输(Hyper Transport,HT)互连、工业标准架构(Industry Standard Architecture,ISA)总线、无限带宽互连、低引脚数(LPC)总线、存储器总线、微信道架构(MCA)总线、外围组件互连(PCI)总线、PCI-Express(PCI-X)总线、串行高级技术附件(SATA)总线、视频电子标准协会局部(VLB)总线或其他合适的总线或者两个或更多个以上这些的组合。在合适的情况下,总线250可包括一个或多个总线。尽管本申请实施例描述和示出了特定的总线,但本申请考虑任何合适的总线或互连。The bus 250 includes a path to transfer information between the various components of the device (eg, the processor 210, the memory 220, the input/output interface 230, and the communication interface 240). By way of example and not limitation, the bus 250 may include an Accelerated Graphics Port (AGP) or other graphics bus, an Extended Industry Standard Architecture (EISA) bus, a Front Side Bus (FSB), Hyper Transport (HT) interconnect, Industry Standard Architecture (ISA) bus, Infiniband interconnect, Low Pin Count (LPC) bus, Memory bus, Micro Channel Architecture (MCA) bus, Peripheral components Interconnect (PCI) bus, PCI-Express (PCI-X) bus, Serial Advanced Technology Attachment (SATA) bus, Video Electronics Standards Association Local (VLB) bus or other suitable bus or two or more of these The combination. Bus 250 may include one or more buses, where appropriate. Although embodiments of this application describe and illustrate a particular bus, this application contemplates any suitable bus or interconnect.
需要说明的是,尽管上述设备仅示出了处理器210、存储器220、输入/输出接口230、通信接口240以及总线250,但是在具体实施过程中,该设备还可以包括实现正常运行所必需的其他组件。此外,本领域的技术人员可以理解的是,上述设备中也可以仅包含实现本申请实施例方案所必需的组件,而不必包含图中所示的全部组件。It should be noted that, although the above-mentioned device only shows the processor 210, the memory 220, the input/output interface 230, the communication interface 240 and the bus 250, in the specific implementation process, the device may also include necessary components for normal operation. other components. In addition, those skilled in the art can understand that, the above-mentioned device may only include components necessary to implement the solutions of the embodiments of the present application, rather than all the components shown in the figures.
上述实施例的设备用于实现前述实施例中相应的方法,并且具有相应的方法实施例的有益效果,在此不再赘述。The devices in the foregoing embodiments are used to implement the corresponding methods in the foregoing embodiments, and have the beneficial effects of the corresponding method embodiments, which will not be repeated here.
基于同一发明构思,本申请一个或多个实施例还提供了一种非暂态计算机可读存储介质,所述非暂态计算机可读存储介质存储计算机指令,所述计算机指令用于使所述计算机执行日上任意一实施例所述的一种图像识别方法。Based on the same inventive concept, one or more embodiments of the present application further provide a non-transitory computer-readable storage medium, where the non-transitory computer-readable storage medium stores computer instructions, and the computer instructions are used to cause the The computer executes an image recognition method described in any one of the embodiments above.
本实施例的计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。The computer readable medium of this embodiment includes both permanent and non-permanent, removable and non-removable media and can be implemented by any method or technology for information storage. Information may be computer readable instructions, data structures, modules of programs, or other data. Examples of computer storage media include, but are not limited to, phase-change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read only memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Flash Memory or other memory technology, Compact Disc Read Only Memory (CD-ROM), Digital Versatile Disc (DVD) or other optical storage, Magnetic tape cartridges, magnetic tape disk storage or other magnetic storage devices or any other non-transmission medium that can be used to store information that can be accessed by a computing device.
以上所述的结构框图中所示的功能块可以实现为硬件、软件、固件或者它们的组合。当以硬件方式实现时,其可以例如是电子电路、专用集成电路(Application Specific Integrated Circuit,ASIC)、适当的固件、插件、功能卡等等。当以软件方式实现时,本申请的元素是被用于执行所需任务的程序或者代码段。程序或者代码段可以存储在机器可读介质中,或者通过载波中携带的数据信号在传输介质或者通信链路上传送。“机器可读介质”可以包括能够存储或传输信息的任何介质。机器可读介质的例子包括电子电路、半导体存储器设备、ROM、闪存、可擦除ROM (EROM)、软盘、CD-ROM、光盘、硬盘、光纤介质、射频(Radio Frequency,RF)链路,等等。代码段可以经由诸如因特网、内联网等的计算机网络被下载。The functional blocks shown in the above-described structural block diagrams may be implemented as hardware, software, firmware, or a combination thereof. When implemented in hardware, it can be, for example, an electronic circuit, an application specific integrated circuit (ASIC), suitable firmware, a plug-in, a function card, and the like. When implemented in software, elements of the present application are programs or code segments used to perform the required tasks. The program or code segments may be stored in a machine-readable medium or transmitted over a transmission medium or communication link by a data signal carried in a carrier wave. A "machine-readable medium" may include any medium that can store or transmit information. Examples of machine-readable media include electronic circuits, semiconductor memory devices, ROM, flash memory, erasable ROM (EROM), floppy disks, CD-ROMs, optical disks, hard disks, fiber optic media, Radio Frequency (RF) links, etc. Wait. The code segments may be downloaded via a computer network such as the Internet, an intranet, or the like.
还需要说明的是,本申请中提及的示例性实施例,基于一系列的步骤或者装置描述一些方法或系统。但是,本申请不局限于上述步骤的顺序,也就是说,可以按照实施例中提及的顺序执行步骤,也可以不同于实施例中的顺序,或者若干步骤同时执行。It should also be noted that the exemplary embodiments mentioned in this application describe some methods or systems based on a series of steps or devices. However, the present application is not limited to the order of the above steps, that is, the steps may be performed in the order mentioned in the embodiment, or may be different from the order in the embodiment, or several steps may be performed simultaneously.
上面参考根据本申请的实施例的方法、装置(系统)和计算机程序产品的流程图和/或框图描述了本申请的各方面。应当理解,流程图和/或框图中的每个方框以及流程图和/或框图中各方框的组合可以由计算机程序指令实现。这些计算机程序指令可被提供给通用计算机、专用计算机、或其它可编程数据处理装置的处理器,以产生一种机器,使得经由计算机或其它可编程数据处理装置的处理器执行的这些指令使能对流程图和/或框图的一个或多个方框中指定的功能/动作的实现。这种处理器可以是但不限于是通用处理器、专用处理器、特殊应用处理器或者现场可编程逻辑电路。还可理解,框图和/或流程图中的每个方框以及框图和/或流程图中的方框的组合,也可以由执行指定的功能或动作的专用硬件来实现,或可由专用硬件和计算机指令的组合来实现。Aspects of the present application are described above with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the present application. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine such that execution of the instructions via the processor of the computer or other programmable data processing apparatus enables the Implementation of the functions/acts specified in one or more blocks of the flowchart and/or block diagrams. Such processors may be, but are not limited to, general purpose processors, special purpose processors, application specific processors, or field programmable logic circuits. It will also be understood that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can also be implemented by special purpose hardware for performing the specified functions or actions, or by special purpose hardware and/or A combination of computer instructions is implemented.
所属领域的普通技术人员应当理解:以上任何实施例的讨论仅为示例性的,并非旨在暗示本公开的范围(包括权利要求)被限于这些例子;在本公开的思路下,以上实施例或者不同实施例中的技术特征之间也可以进行组合,步骤可以以任意顺序实现,并存在如上所述的本申请一个或多个实施例的不同方面的许多其它变化,为了简明它们没有在细节中提供。It should be understood by those of ordinary skill in the art that the discussion of any of the above embodiments is only exemplary, and is not intended to imply that the scope of the present disclosure (including the claims) is limited to these examples; under the spirit of the present disclosure, the above embodiments or Technical features in different embodiments can also be combined, steps can be implemented in any order, and there are many other variations of the different aspects of one or more embodiments of the application as described above, which are not in detail for the sake of brevity supply.
另外,为简化说明和讨论,并且为了不会使本申请一个或多个实施例难以理解,在所提供的附图中可以示出或可以不示出与集成电路(IC)芯片和其它部件的公知的电源/接地连接。此外,可以以框图的形式示出设备,以便避免使本申请一个或多个实施例难以理解,并且这也考虑了以下事实,即关于这些框图设备的实施方式的细节是高度取决于将要实施本申请一个或多个实施例的平台的(即,这些细节应当完全处于本领域技术人 员的理解范围内)。在阐述了具体细节(例如,电路)以描述本公开的示例性实施例的情况下,对本领域技术人员来说显而易见的是,可以在没有这些具体细节的情况下或者这些具体细节有变化的情况下实施本申请一个或多个实施例。因此,这些描述应被认为是说明性的而不是限制性的。Additionally, in order to simplify illustration and discussion, and in order not to obscure one or more embodiments of the present application, the figures provided may or may not be shown in connection with integrated circuit (IC) chips and other components. Well known power/ground connection. Furthermore, devices may be shown in block diagram form in order to avoid obscuring one or more embodiments of the present application, and this also takes into account the fact that details regarding the implementation of such block diagram devices are highly dependent on the implementation of the present application (ie, these details should be well within the purview of those skilled in the art) to which one or more embodiments are claimed. Where specific details (eg, circuits) are set forth to describe exemplary embodiments of the present disclosure, it will be apparent to those skilled in the art that these specific details may be made without or with changes One or more embodiments of the present application are implemented below. Accordingly, these descriptions are to be regarded as illustrative rather than restrictive.
尽管已经结合了本公开的具体实施例对本公开进行了描述,但是根据前面的描述,这些实施例的很多替换、修改和变型对本领域普通技术人员来说将是显而易见的。例如,其它存储器架构(例如,动态RAM(DRAM))可以使用所讨论的实施例。Although the present disclosure has been described in conjunction with specific embodiments thereof, many alternatives, modifications, and variations to these embodiments will be apparent to those of ordinary skill in the art from the foregoing description. For example, other memory architectures (eg, dynamic RAM (DRAM)) may use the discussed embodiments.
本申请一个或多个实施例旨在涵盖落入所附权利要求的宽泛范围之内的所有这样的替换、修改和变型。因此,凡在本申请一个或多个实施例的精神和原则之内,所做的任何省略、修改、等同替换、改进等,均应包含在本公开的保护范围之内。The one or more embodiments of the present application are intended to cover all such alternatives, modifications and variations that fall within the broad scope of the appended claims. Therefore, any omission, modification, equivalent replacement, improvement, etc. made within the spirit and principle of one or more embodiments of the present application should be included within the protection scope of the present disclosure.

Claims (10)

  1. 一种图像识别方法,包括:An image recognition method, comprising:
    获取全景图像及特写图像,所述全景图像包含若干对象,所述特写图像包含预设对象;所述预设对象为所述若干对象中满足预设条件的对象;acquiring a panoramic image and a close-up image, the panoramic image includes several objects, and the close-up image includes a preset object; the preset object is an object that satisfies a preset condition among the several objects;
    对所述特写图像及所述全景图像进行比对,确定所述全景图像中与所述特写图像对应的对应区域,所述对应区域包括所述预设对象;comparing the close-up image and the panoramic image, and determining a corresponding area in the panoramic image corresponding to the close-up image, where the corresponding area includes the preset object;
    对所述特写图像进行图像识别,确定所述预设对象的识别信息;Perform image recognition on the close-up image to determine the identification information of the preset object;
    基于所述识别信息,根据所述对应区域与所述特写图像的对应关系,确定所述对应区域中对象的识别结果。Based on the identification information, the identification result of the object in the corresponding area is determined according to the corresponding relationship between the corresponding area and the close-up image.
  2. 根据权利要求1所述的方法,其中,所述对所述特写图像及所述全景图像进行比对,确定所述全景图像中与所述特写图像对应的对应区域,包括:The method according to claim 1, wherein the comparing the close-up image and the panoramic image to determine a corresponding area in the panoramic image corresponding to the close-up image comprises:
    确定所述特写图像的比对区域及所述全景图像的比对区域;determining the comparison area of the close-up image and the comparison area of the panoramic image;
    对所述特写图像的比对区域与所述全景图像的比对区域进行相似度比对;performing similarity comparison between the comparison area of the close-up image and the comparison area of the panoramic image;
    若相似度比对符合设定条件,则将所述全景图像的比对区域作为所述对应区域。If the similarity comparison meets the set condition, the comparison area of the panoramic image is used as the corresponding area.
  3. 根据权利要求1所述的方法,其中,所述对所述特写图像进行图像识别,确定所述预设对象的识别信息,包括:The method according to claim 1, wherein the performing image recognition on the close-up image to determine the identification information of the preset object comprises:
    对所述特写图像进行特征识别,确定所述特写图像对应的所述预设对象的身份信息,将所述身份信息作为所述识别信息。The feature identification is performed on the close-up image, the identity information of the preset object corresponding to the close-up image is determined, and the identity information is used as the identification information.
  4. 根据权利要求1所述的方法,其中,所述预设对象,具体为:The method according to claim 1, wherein the preset object is specifically:
    所述若干对象中到所述全景图像的获取位置的距离大于预设阈值的对象。Among the several objects, the distance to the acquisition position of the panoramic image is greater than a preset threshold.
  5. 根据权利要求1所述的方法,其中,所述预设对象,具体为:The method according to claim 1, wherein the preset object is specifically:
    所述若干对象中到所述全景图像的聚焦区域的距离大于预设阈值的对象。Among the several objects, the distance to the focus area of the panoramic image is greater than a preset threshold.
  6. 根据权利要求1所述的方法,其中,所述预设对象,具体为:The method according to claim 1, wherein the preset object is specifically:
    所述若干对象中在所述全景图像中清晰度小于预设阈值的对象。Among the several objects, an object whose definition is smaller than a preset threshold in the panoramic image.
  7. 根据权利要求1-6任一项所述的方法,其中,所述全景图像和所述特写图像为在同一时段内采集得到的。The method according to any one of claims 1-6, wherein the panoramic image and the close-up image are acquired within the same period of time.
  8. 一种图像识别系统,包括:全景摄像头、特写摄像头和处理器;其中,An image recognition system, comprising: a panoramic camera, a close-up camera and a processor; wherein,
    所述全景摄像头,被配置为采集全景图像,所述全景图像包含若干对象;the panoramic camera, configured to capture a panoramic image, the panoramic image including several objects;
    所述特写摄像头,被配置为采集特写图像,所述特写图像包含预设对象,所述预设对象为所述若干对象中满足预设条件的对象;the close-up camera is configured to collect a close-up image, the close-up image includes a preset object, and the preset object is an object that satisfies a preset condition among the several objects;
    所述处理器,被配置为执行如权利要求1至7任一项所述的方法。The processor, configured to perform the method of any one of claims 1 to 7.
  9. 一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现如权利要求1至7任一项所述的方法。An electronic device includes a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the method according to any one of claims 1 to 7 when the processor executes the program.
  10. 一种非暂态计算机可读存储介质,所述非暂态计算机可读存储介质存储计算机指令,所述计算机指令用于使所述计算机执行权利要求1至7任一项所述的方法。A non-transitory computer-readable storage medium storing computer instructions for causing the computer to perform the method of any one of claims 1 to 7.
PCT/CN2020/141047 2020-11-19 2020-12-29 Image recognition method and system, electronic device, and storage medium WO2022105027A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011307535.5 2020-11-19
CN202011307535.5A CN112308018A (en) 2020-11-19 2020-11-19 Image identification method, system, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2022105027A1 true WO2022105027A1 (en) 2022-05-27

Family

ID=74336026

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/141047 WO2022105027A1 (en) 2020-11-19 2020-12-29 Image recognition method and system, electronic device, and storage medium

Country Status (2)

Country Link
CN (1) CN112308018A (en)
WO (1) WO2022105027A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114419698A (en) * 2021-12-27 2022-04-29 天津天地伟业信息系统集成有限公司 Control method of face recognition camera and computer readable storage medium
CN114630396A (en) * 2021-12-31 2022-06-14 厦门阳光恩耐照明有限公司 Intelligent lamp Bluetooth configuration method and system based on image recognition
CN115474076A (en) * 2022-08-15 2022-12-13 珠海视熙科技有限公司 Video stream image output method and device and camera equipment

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105554390A (en) * 2015-12-29 2016-05-04 广东欧珀移动通信有限公司 Shooting method and device, selfie stick, shooting system and mobile terminal
CN105812746A (en) * 2016-04-21 2016-07-27 北京格灵深瞳信息技术有限公司 Target detection method and system
CN109215055A (en) * 2017-06-30 2019-01-15 杭州海康威视数字技术股份有限公司 A kind of target's feature-extraction method, apparatus and application system
CN109299696A (en) * 2018-09-29 2019-02-01 成都臻识科技发展有限公司 A kind of method for detecting human face and device based on dual camera
CN110648299A (en) * 2018-06-26 2020-01-03 株式会社理光 Image processing method, image processing apparatus, and computer-readable storage medium
CN111292278A (en) * 2019-07-30 2020-06-16 展讯通信(上海)有限公司 Image fusion method and device, storage medium and terminal
CN111353361A (en) * 2019-08-19 2020-06-30 深圳市鸿合创新信息技术有限责任公司 Face recognition method and device and electronic equipment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105554390A (en) * 2015-12-29 2016-05-04 广东欧珀移动通信有限公司 Shooting method and device, selfie stick, shooting system and mobile terminal
CN105812746A (en) * 2016-04-21 2016-07-27 北京格灵深瞳信息技术有限公司 Target detection method and system
CN109215055A (en) * 2017-06-30 2019-01-15 杭州海康威视数字技术股份有限公司 A kind of target's feature-extraction method, apparatus and application system
CN110648299A (en) * 2018-06-26 2020-01-03 株式会社理光 Image processing method, image processing apparatus, and computer-readable storage medium
CN109299696A (en) * 2018-09-29 2019-02-01 成都臻识科技发展有限公司 A kind of method for detecting human face and device based on dual camera
CN111292278A (en) * 2019-07-30 2020-06-16 展讯通信(上海)有限公司 Image fusion method and device, storage medium and terminal
CN111353361A (en) * 2019-08-19 2020-06-30 深圳市鸿合创新信息技术有限责任公司 Face recognition method and device and electronic equipment

Also Published As

Publication number Publication date
CN112308018A (en) 2021-02-02

Similar Documents

Publication Publication Date Title
WO2022105027A1 (en) Image recognition method and system, electronic device, and storage medium
CN106650662B (en) Target object shielding detection method and device
CN110232369B (en) Face recognition method and electronic equipment
US12051258B2 (en) Image processing to detect a rectangular object
WO2018049998A1 (en) Traffic sign information acquisition method and device
CN109166156B (en) Camera calibration image generation method, mobile terminal and storage medium
JP2017520050A (en) Local adaptive histogram flattening
CN110062157B (en) Method and device for rendering image, electronic equipment and computer readable storage medium
CN108509994B (en) Method and device for clustering character images
CN112085775B (en) Image processing method, device, terminal and storage medium
WO2015039575A1 (en) Method and system for performing image identification
CN108229494B (en) Network training method, processing method, device, storage medium and electronic equipment
US20230177680A1 (en) Assay reading method
CN111832579B (en) Map interest point data processing method and device, electronic equipment and readable medium
CN113228622A (en) Image acquisition method, image acquisition device and storage medium
JP6374849B2 (en) User terminal, color correction system, and color correction method
CN111126159A (en) Method, apparatus, electronic device, and medium for tracking pedestrian in real time
WO2021233058A1 (en) Method for monitoring articles on shop shelf, computer and system
CN110490022A (en) A kind of bar code method and device in identification picture
CN112270305A (en) Card image recognition method and device and electronic equipment
CN111666884A (en) Living body detection method, living body detection device, computer-readable medium, and electronic apparatus
CN110827254A (en) Method and device for determining image definition
CN113508391B (en) Data processing method, device and system, medium and computer equipment
CN115620005A (en) Image identification method, device, equipment and storage medium
CN113628192A (en) Image blur detection method, device, apparatus, storage medium, and program product

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20962318

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20962318

Country of ref document: EP

Kind code of ref document: A1