WO2019051814A1 - Target recognition method and apparatus, and intelligent terminal - Google Patents

Target recognition method and apparatus, and intelligent terminal Download PDF

Info

Publication number
WO2019051814A1
WO2019051814A1 PCT/CN2017/101967 CN2017101967W WO2019051814A1 WO 2019051814 A1 WO2019051814 A1 WO 2019051814A1 CN 2017101967 W CN2017101967 W CN 2017101967W WO 2019051814 A1 WO2019051814 A1 WO 2019051814A1
Authority
WO
WIPO (PCT)
Prior art keywords
tested
result
recognition
confidence
information
Prior art date
Application number
PCT/CN2017/101967
Other languages
French (fr)
Chinese (zh)
Inventor
廉士国
刘兆祥
王宁
Original Assignee
达闼科技(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 达闼科技(北京)有限公司 filed Critical 达闼科技(北京)有限公司
Priority to CN201780002292.4A priority Critical patent/CN107995982B/en
Priority to PCT/CN2017/101967 priority patent/WO2019051814A1/en
Publication of WO2019051814A1 publication Critical patent/WO2019051814A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/94Hardware or software architectures specially adapted for image or video understanding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Definitions

  • the embodiments of the present invention relate to the field of intelligent identification technologies, and in particular, to a target identification method, apparatus, and intelligent terminal.
  • the environment is variable, and the recognition capability of the smart terminal is limited.
  • the smart terminal may not be able to accurately identify the target to be tested, for example, due to light, angle, or occlusion. Identify who this person is, and for example, because of the distance and angle, you can't be sure of the brand or model of the car.
  • the intelligent terminal if it is required to output a higher-level recognition result, it may cause embarrassment due to the recognition error; or, if the output result is discarded because the detailed result is not obtained, Not conducive to a user-friendly experience.
  • the embodiment of the present application provides a target recognition method, device, and intelligent terminal, which can solve the problem of how to achieve a compromise between reliability and detail level of target recognition.
  • an embodiment of the present application provides a target identification method, which is applied to an intelligent terminal, and includes include:
  • Collecting information for the object to be tested includes at least two types of attributes, and a priority relationship is set between the at least two types of attributes;
  • the recognition result of the object to be tested is a determination result corresponding to one of the attribute types, the confidence of the determination result satisfies a preset condition, and the recognition result corresponds to
  • the attribute type has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
  • an embodiment of the present application provides a target identification apparatus, including:
  • An information collecting unit configured to collect information about a target to be tested, where the object to be tested includes at least two types of attributes, and a priority relationship is set between the at least two types of attributes;
  • a recognition unit configured to output a recognition result of the object to be tested based on the information, where the recognition result is a determination result corresponding to one of the attribute types, the confidence of the determination result satisfies a preset condition, and The attribute type corresponding to the recognition result has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
  • an intelligent terminal including:
  • At least one processor and,
  • the memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the target recognition method as described above.
  • an embodiment of the present application provides a non-transitory computer readable storage medium, where the non-transitory computer readable storage medium stores computer executable instructions for causing a smart terminal to execute the above The target recognition method.
  • the embodiment of the present application further provides a computer program product, the computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program When the instruction is executed by the smart terminal, causing the smart terminal to execute as above The target recognition method.
  • the target identification method, apparatus, and intelligent terminal provided by the embodiments of the present application divide multiple attributes with priority order for the attributes of the target to be tested according to the degree of detail of the description of the object to be tested.
  • Type and in the process of identification, obtain the confidence of the judgment result under each attribute type, and then output the judgment result corresponding to the attribute type with the highest priority among the judgment results satisfying the preset condition according to the actual recognition situation.
  • the recognition result of the target to be measured can ensure the reliability of the output recognition result under different recognition scenarios, and at the same time, output a more detailed recognition result as much as possible, that is, the final recognition result can be reliability and A compromise between levels of detail improves the user experience.
  • FIG. 1 is a schematic flowchart diagram of a target recognition method according to an embodiment of the present application
  • FIG. 2 is a schematic flow chart of another object recognition method provided by an embodiment of the present application.
  • FIG. 3 is a schematic structural diagram of a target recognition apparatus according to an embodiment of the present application.
  • FIG. 4 is a schematic structural diagram of hardware of an intelligent terminal according to an embodiment of the present application.
  • the embodiment of the present application provides a target recognition method, device, and intelligent terminal, which can be applied to any application field related to target recognition, such as: intelligent guide blind, welcome robot, service robot, intrusion object detection, semantic recognition, and the like.
  • the target recognition method provided by the embodiment of the present application is an intelligent optimization identification method based on the “priority” of the attribute type of the target to be measured and the “confidence” of the determination result under each attribute type, according to the
  • the degree of detail of the description of the target is divided into a plurality of attribute types having priority order for the attribute of the target to be tested (where the higher the priority type, the higher the degree of detail corresponding to the judgment result), and in the process of identification
  • a confidence level for evaluating the reliability of the judgment result is set, and then the judgment corresponding to the attribute type having the highest priority among the judgment results satisfying the preset condition is output according to the actual recognition situation.
  • the target recognition method and device and the intelligent terminal provided by the embodiment of the present application identify the same person/object (the object to be tested), different levels of recognition results may be output under different recognition environments. For example, in the case of identifying a person, when the light is good, the distance is close, and the person being tested is facing the machine camera, the "person name" of the person to be tested can be identified; when the person being tested covers half of the face by hand, or sideways When the machine camera is used, only the "gender” of the person being tested can be identified; when the person being tested is facing the camera of the machine, only whether it is a "person” can be identified.
  • the object recognition method and apparatus provided by the embodiments of the present application can be applied to any type of smart terminal, such as a robot, a guide glasses, a smart helmet, a smart phone, a tablet computer, a server, and the like.
  • the smart terminal can include any suitable type of storage medium for storing data, such as a magnetic disk, a compact disc (CD-ROM), a read-only memory or a random access memory.
  • the smart terminal may also include one or more logical computing modules that perform any suitable type of function or operation in parallel, such as viewing a database, image processing, etc., in a single thread or multiple threads.
  • the logic operation module may be any suitable type of electronic circuit or chip-type electronic device capable of performing logical operation operations, such as a single core processor, a multi-core processor, a graphics processing unit (GPU), or the like.
  • FIG. 1 is a schematic flowchart of a target identification method according to an embodiment of the present application. Referring to FIG. 1, the method includes but is not limited to:
  • the target to be tested may include, but is not limited to, a person, an animal, an object, and the like. According to the degree of detail of the description of the object to be measured, at least two different levels of attribute types may be divided for the object to be tested, and priority relationships are set for the attribute types according to the level of detail.
  • the attribute type with difficulty in identifying is relatively high in detail, and the difficulty level of recognition can be sorted according to the recognition rate of different attribute types under the same conditions (for example, inputting the same picture) (for example)
  • person name recognition is difficult to identify by gender, gender recognition is difficult for face/human recognition); or, it can be sorted according to the mutual inclusion relationship between attribute types (for example, to identify the gender, the presence of the face must be recognized first).
  • the attribute type of the person may be set according to the degree of detail of the description of the object to be tested, including: "person name”, “gender”, and “whether it is a person”, and according to the difficulty level of recognition, You can set the priority order of these attribute types to be: L1 (person name) > L2 (gender) > L3 (whether it is human).
  • the attribute types of the vehicle can be set to include: “a license plate”, “a model of the vehicle”, “the color of the vehicle”, and “whether or not the vehicle”, and according to the difficulty level of the identification, Set the priority order of these attribute types to: L1 (vehicle license plate) > L2 (model of the car) > L3 (color of the car) > L4 (whether it is a car).
  • the “information” may be any judgment basis that can reflect the attribute of the object to be tested, and the type of the information may include, but is not limited to, image information, sound information, thermal infrared image, near-infrared image. , ultrasonic signals, electromagnetic reflection signals, etc.
  • information about the object to be tested may be collected by one or more sensors, for example, collecting image information for the object to be tested through the camera, collecting sound information for the object to be tested through the microphone, and passing the thermal infrared.
  • the sensor collects a thermal infrared image or the like for the target to be tested.
  • each attribute type of the object to be tested corresponds to a determination result
  • each determination result corresponds to a reliable one for characterizing the determination result.
  • Confidence in sexuality or credibility.
  • the judgment results obtained by the target to be tested include: “Zhang San” (confidence is 70%), “male” (confidence is 89%), “person” ( The confidence level is 100%), then, “Zhang San”, “Men” and “People” are the judgment results corresponding to the attribute types "person name", "gender” and "whether or not” of the object to be tested.
  • the confidence level of the judgment result can be determined by the similarity degree of the feature comparison, and the higher the degree of similarity, the higher the confidence degree.
  • the outputting result is a determination result corresponding to one of the attribute types of the object to be tested, the confidence of the determination result satisfies a preset condition, and the attribute type corresponding to the recognition result is in confidence.
  • the priority of the attribute type corresponding to the judgment result satisfying the preset condition is the highest.
  • the “preset condition” may be set according to an actual application scenario, and used to identify the reliability of a certain judgment result.
  • the preset condition may be: a confidence level of the determination result is greater than or equal to a confidence threshold corresponding to the corresponding attribute type.
  • the confidence threshold corresponding to each attribute type may be the same.
  • the confidence thresholds corresponding to the attribute types “person name”, “gender”, and “whether or not” are both 70%, and the judgment result of the target to be tested is obtained. Including: “Zhang San” (confidence is 70%), “male” (confidence is 89%), "person” (confidence is 100%), then, the judgment results “Zhang San", "male” and " The confidence degree of the person meets the preset condition.
  • the recognition result of the object to be tested is the judgment result "Zhang San” corresponding to the attribute type "person name” having the highest priority among the three.
  • the confidence threshold corresponding to each attribute type may also be different.
  • the confidence threshold corresponding to the attribute type “person name” may be preset to be 75%, corresponding to the attribute type “gender”. The confidence threshold is 85%, and the confidence threshold corresponding to the attribute type “is it human” is 95%. If the target to be tested is the same, “Zhang San” (confidence is 70%), “male” ( Confidence is 89%), "People” (confidence is 100%), then the judgment result that the confidence meets the preset condition includes only "male” and "person”. At this time, the recognition result of the target to be tested is The highest priority attribute type "gender” corresponds to the judgment result "male”.
  • the identification result of the target to be tested is output based on the collected information.
  • Embodiments may include, but are not limited to, the following two embodiments:
  • the determination result corresponding to each attribute type of the object to be tested and the confidence of each determination result may be firstly obtained based on the collected information; and then the priority of the determination result that the confidence degree satisfies the preset condition is output.
  • the judgment result corresponding to the highest attribute type is used as the recognition result of the object to be tested.
  • the determination result corresponding to each attribute type of the object to be tested based on the collected information may be implemented by using a suitable algorithm (for example, a neural network). For example, if the target to be measured is a person and the information collected by the smart terminal is the image information of the person, the smart terminal may iteratively calculate the attribute type “whether it is a person”, “gender” and “person name” from the image.
  • a suitable algorithm for example, a neural network
  • the judgment result for example, firstly calculates the feature 1 for discriminating whether it is a person by the bottom layer of the neural network, and obtains the judgment result of "whether it is a person” according to the feature 1 and the confidence of the judgment result; then, in the neural network The middle layer calculates feature 2 for discriminating "gender” based on feature 1, and obtains the judgment result corresponding to "gender” according to feature 2 and the confidence of the judgment result; finally, based on feature 2 at the uppermost layer of the neural network
  • the feature 3 for discriminating the "person name” is obtained, and according to the feature 3, the judgment result corresponding to the "person name” and the confidence of the judgment result are obtained.
  • the judgment result that the confidence degree satisfies the preset requirement is first screened, and then the judgment of the target to be tested is selected to the highest degree of detail (that is, the corresponding attribute type has the highest priority). The result is the recognition result of the target to be tested.
  • the determination result corresponding to each attribute type of the target to be tested and the confidence of each determination result may be obtained step by step according to the priority from high to low, until the first
  • the determination result that the confidence degree satisfies the preset condition appears the determination result that the first confidence degree satisfies the preset condition is output as the recognition result of the object to be tested. That is, when the information for the object to be tested is collected, firstly, based on the collected information, the first-level judgment result corresponding to the attribute type with the highest priority and the first-level confidence level of the first-level judgment result are obtained, if the first-level confidence is satisfied.
  • the preset condition for example, the first-level confidence is greater than or equal to the first-level confidence threshold
  • the first-level judgment result is directly output as the recognition result of the target to be tested, otherwise, the attribute level corresponding to the next level is obtained based on the collected information.
  • the second-level judgment result and the second-level confidence of the second-level judgment result if the second-level confidence level satisfies a preset condition (for example, the second-level confidence level is greater than or equal to the second-level confidence threshold), the second-level judgment result is output As a result of the identification of the target to be tested, otherwise, the determination result corresponding to the attribute type of the next level and the confidence thereof are obtained based on the collected information, and the loop is repeated until the determination result that the confidence level satisfies the preset condition is obtained.
  • a preset condition for example, the second-level confidence level is greater than or equal to the second-level confidence threshold
  • different features may be extracted from the collected information for different levels of judgment. For example, if the target to be measured is a vehicle and the collected information is image information for the vehicle, The feature a can be extracted from the image information for identifying whether there is a car in the image, the feature b is extracted for identifying the color of the car in the image, and the feature c is extracted for identifying the type of the car (car, truck, bus, etc.) Wait.
  • the judgment result corresponding to each attribute type of the object to be tested and the confidence thereof are obtained step by step according to the order of priority from high to low, when the first confidence level satisfies the judgment result of the preset condition Directly outputting the judgment result that the first confidence degree satisfies the preset condition, without identifying and judging each attribute type, thereby reducing the amount of data processing, and improving the recognition without affecting the level of detail and reliability. effectiveness.
  • the target recognition method may further include: transmitting an interaction signal corresponding to the recognition result.
  • the smart glasses or smart helmet for guiding blind can send a voice prompt “Your friend Zhang San” to the user for welcoming or providing service.
  • the robot can say "Hello! VIP customer Zhang San!” to the target, and/or adjust the gestures exclusive to VIP customers.
  • the recognition result outputted in step 120 is “male”
  • the smart glasses or the smart helmet for guiding the blind can give the user a voice prompt “there is a man in front”, and the robot for welcoming or providing the service can be tested.
  • the goal is "Hello! Sir!.
  • the object recognition method provided by the embodiment of the present application is that the target identification method provided by the embodiment of the present application divides multiple attribute types with priority order for the attribute of the target to be tested according to the detailed level of the description of the object to be tested. And in the process of identification, get each attribute type The confidence of the result of the determination is determined, and then the judgment result corresponding to the attribute type having the highest priority among the judgment results satisfying the preset condition is output as the recognition result of the object to be tested according to the actual recognition situation, and can be in different recognition scenarios. The reliability of the recognition result of the output is ensured, and at the same time, the more detailed recognition result is output as much as possible, that is, the final recognition result can be compromised between reliability and detail level, thereby improving the user experience.
  • the second embodiment of the present application further provides another target identification method.
  • the collected information includes at least two types of information sources.
  • FIG. 2 includes but is not limited to:
  • the "information source” refers to a source of information capable of reflecting an attribute of a target to be tested.
  • the “at least two information sources” may be at least two different types of information, such as any two or more of image information, sound information, thermal infrared images, near infrared images, ultrasonic signals, or electromagnetic reflection signals;
  • the “at least two information sources” may also be some type of information collected from at least two angles or moments, for example, image information (or sound information) of the target to be measured is collected from multiple angles, and each Image information (or sound information) acquired from a single perspective can be used as a source of information.
  • the “at least two information sources” may also be a combination of the above two forms.
  • the information collected for the target to be tested includes image information collected from multiple angles and from one image. Sound information collected at an angle.
  • step 110 the specific implementation manner of collecting each information source may refer to step 110 in the foregoing embodiment 1, and will not be described in detail herein.
  • the recognition result of the target to be tested is obtained by means of multi-information fusion.
  • the specific implementation manner of outputting the recognition result of the object to be tested based on the collected at least two types of information sources may include, but is not limited to, the following three implementation manners:
  • the target to be tested may be identified by means of “sub-mode fusion”, that is, firstly, the sub-recognition results of the object to be tested are acquired based on the collected at least two information sources, and then the sub-identifications are determined according to the sub-identifications.
  • the result outputs the recognition result of the object to be tested.
  • the “sub-recognition result” refers to a recognition result obtained based on only one information source, and each information source corresponds to one sub-recognition result. Therefore, in this embodiment, the sub-recognition result also includes at least two, and each sub-recognition result has a corresponding confidence level for evaluating the reliability of the sub-recognition result.
  • the sub-recognition result corresponding to each information source may be separately obtained by the target identification method (shown in FIG. 1) provided in the first embodiment, and then selected from the sub-recognition results.
  • the most detailed sub-identification result is used as the recognition result of the object to be tested.
  • the level of detail of the sub-recognition result may be determined by the priority of the attribute type corresponding to the sub-recognition result, and the higher the priority of the corresponding attribute type, the higher the level of detail, for example, the obtained sub-recognition result includes “ “person” and "girl”, wherein the attribute type corresponding to the child recognition result "person” is "whether it is a person", the attribute type of the child recognition result "girl” is "gender”, and the priority of "gender” is higher than " If the person is a person, the sub-recognition result "girl” is higher in detail than the sub-recognition result "person”, so that the sub-recognition result "girl” can be used as the recognition result of the object to be tested.
  • the steps 110 to 120 in the first embodiment may be performed based on the collected image information; and the first embodiment is executed based on the collected sound information. Steps 110 to 120. It is assumed that the sub-recognition result output based on the acquired image information is “person”, and the sub-recognition result output based on the collected sound information is “Li Si”, and the sub-recognition result “Li Si” with higher level of detail can be output. "As the result of the identification of the target to be tested.
  • the sub-recognition result with the highest degree of confidence can be selected from the sub-identification result with the highest degree of detail as the recognition result of the object to be tested, for example, the confidence of the sub-recognition result “boy” is 70%, and the sub-recognition result “ The confidence of the girl is 90% (>70%), so that the sub-recognition result “girl” can be selected as the target of the object to be tested. result.
  • the level of detail of the target recognition can be further improved.
  • the target to be tested may be identified by means of a hierarchical decision fusion, that is, the test is obtained step by step according to the priority from the highest to the lowest based on the at least two information sources.
  • the sub-judgment result of the condition is used as the recognition result of the object to be tested.
  • the “sub-judgment result” refers to a judgment result of the object to be tested under a certain attribute type based on only one information source analysis, and each information source corresponds to one sub-judgment result under each attribute type. .
  • the “sub-confidence” refers to the degree of credibility of the sub-judgment result and is used to characterize the reliability of the sub-judgment result.
  • the target to be tested is a person
  • the attribute types include: “person name”, “gender” and “whether it is a person”
  • the priority relationship is: L1 (person name)>L2 (gender)>L3 (whether it is a person)
  • collection The obtained information includes image information and sound information.
  • the sub-judgment result corresponding to the “person name” and the confidence thereof are first obtained based on the image information and the sound information, respectively, and it is assumed that the sub-judgment result obtained based on the image information is “Li Si”.
  • the sub-judgment result "Li Si” is the sub-judgment result that the first sub-confidence satisfies the preset condition (meeting the first preset condition or the second preset condition).
  • the sub-judgment result with the highest confidence among the sub-judgment results may be selected as the recognition result of the target to be tested.
  • the target to be tested is hierarchically identified based on at least two information sources, respectively, as long as one of the information sources is analyzed to obtain an optimal recognition result (ie, the most reliable and detailed identification node). Therefore, the optimal recognition result can be directly output, and the efficiency of target recognition can be improved.
  • the target to be tested may be identified by using a “hierarchical fusion decision”, that is, the to-be-selected from the at least two information sources is sequentially stepped according to the order of priority from high to low. Measure the feature corresponding to each attribute type of the target, and obtain the judgment result corresponding to each attribute type and the confidence of the judgment result according to the feature corresponding to each attribute type; until the first confidence level satisfies the determination result of the preset condition When it appears, the judgment result that the first confidence level satisfies the preset condition is output as the recognition result of the object to be tested.
  • a “hierarchical fusion decision” that is, the to-be-selected from the at least two information sources is sequentially stepped according to the order of priority from high to low.
  • the target to be tested is a person
  • the attribute types include: “person name”, “gender” and “whether it is a person”, and the priority relationship is: L1 (person name)>L2 (gender)>L3 (whether it is a person), collection
  • the obtained information includes image information and sound information
  • the first-level feature A1 for identifying the "person name” may be first extracted from the collected image information, and the "person name” is extracted from the sound information.
  • the first-level feature A2 and then the two types of first-level features A1 and A2 are merged together (for example, A1 and A2 are spliced together by a neural network separator that combines two types of features) to generate feature A, and then identify according to feature A.
  • the first-level judgment result of the person name and the first-level confidence of the first-level judgment result if the first-level confidence level satisfies the first-level preset condition, the first-level judgment result is output; otherwise, the collected image information is extracted A secondary feature B1 for identifying "gender", and a secondary feature B2 for identifying "gender” are extracted from the sound information, and then the two types of secondary features B1 and B2 are fused together to generate feature B.
  • the second-level judgment result of “gender” and the second-level confidence of the second-level judgment result are recognized, and if the second-level confidence level satisfies the second-level preset condition, the second-level judgment result is output; otherwise, continue The judgment result corresponding to the attribute type of the next level and the confidence thereof are obtained, and the loop is obtained until the judgment result that the confidence degree satisfies the preset condition is obtained.
  • the determination information of the target recognition can be enriched, which not only can improve the detail level of the target recognition, but also improve the efficiency of the target recognition.
  • the object recognition method provided by the embodiment of the present application can improve the target identification method provided by the embodiment of the present application by collecting at least two types of information sources and outputting the recognition result of the object to be tested according to the at least two information sources.
  • the level of detail and efficiency of target recognition can improve the target identification method provided by the embodiment of the present application by collecting at least two types of information sources and outputting the recognition result of the object to be tested according to the at least two information sources.
  • FIG. 3 is a schematic structural diagram of a target recognition apparatus according to an embodiment of the present application.
  • the target identification apparatus 3 includes:
  • the information collecting unit 31 is configured to collect information about a target to be tested, where the object to be tested includes at least two types of attributes, and a priority relationship is set between the at least two types of attributes;
  • the identification unit 32 is configured to output a recognition result of the object to be tested based on the information, where the recognition result is a determination result corresponding to one of the attribute types, the confidence of the determination result satisfies a preset condition, and The attribute type corresponding to the recognition result has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
  • the recognition unit 32 when the information collecting unit 31 collects information for the object to be tested, the recognition unit 32 outputs the recognition result of the object to be tested based on the information.
  • the object to be tested includes at least two attribute types, and a priority relationship is set between the at least two attribute types; the recognition result is a determination result corresponding to one of the attribute types, and the determination result is The confidence level satisfies the preset condition, and the attribute type corresponding to the recognition result has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
  • the identifying unit 32 is specifically configured to: obtain, according to the information, a determination result corresponding to each attribute type of the object to be tested and a confidence level of each determination result; and output confidence that the preset condition meets the preset condition
  • the judgment result corresponding to the attribute type having the highest priority among the judgment results is used as the recognition result of the object to be tested.
  • the identification unit 32 includes an analysis module 321 and an output module 322.
  • the analysis module 321 is configured to acquire the information according to the target to be tested according to the priority from high to low. a determination result corresponding to each attribute type of the object to be tested and a confidence level of each determination result; the output module 322 is configured to output the first confidence when the first confidence level meets the preset condition The judgment result that satisfies the preset condition is used as the recognition result of the object to be tested.
  • the analysis module 321 is specifically configured to: according to the order of priority from high to low, And extracting, from the at least two information sources, the features corresponding to each attribute type of the object to be tested, and obtaining the determination result corresponding to each attribute type and the determination result according to the feature corresponding to each attribute type Confidence.
  • the identifying unit 32 is specifically configured to: based on the at least two information sources respectively, according to the priority from high to low.
  • the sub-judging result corresponding to each attribute type of the object to be tested and the sub-confidence of each sub-judgment result are obtained step by step until the sub-judgment result of the first sub-confidence satisfying the preset condition appears, and the output is a sub-judgment result that satisfies a preset condition as a recognition result of the object to be tested; or, a sub-recognition result of the object to be tested is acquired based on the at least two information sources, wherein each An information source corresponds to a sub-recognition result; and the recognition result of the object to be tested is output according to the sub-recognition result.
  • the target recognition device 3 further includes:
  • the interaction unit 33 is configured to send an interaction signal corresponding to the recognition result.
  • the object recognition apparatus divides the attribute type of the priority order into the attribute of the object to be tested according to the detailed degree of the description of the object to be tested. And in the process of identification, the confidence unit obtains the confidence of the judgment result under each attribute type, and then outputs the judgment result corresponding to the attribute type with the highest priority among the judgment results satisfying the preset condition according to the actual recognition situation.
  • the recognition result of the object to be tested it is possible to ensure the reliability of the output recognition result under different recognition scenarios, and at the same time, output a more detailed recognition result as much as possible, that is, the final recognition result can be reliably A compromise between sex and level of detail improves the user experience.
  • the 400 can be any type of smart terminal, such as a robot, a blind eyeglass, a smart helmet, a smart phone, a tablet, a server, etc., and can perform the target recognition method provided by the first embodiment and/or the second embodiment.
  • the smart terminal 400 includes:
  • One or more processors 401 and memory 402 are exemplified by one processor 401 in FIG.
  • the processor 401 and the memory 402 can be connected by a bus or other means, and the connection by a bus is taken as an example in FIG.
  • the memory 402 is used as a non-transitory computer readable storage medium, and can be used for storing a non-transitory software program, a non-transitory computer executable program, and a module, such as a program instruction/module corresponding to the target recognition method in the embodiment of the present application.
  • a module such as a program instruction/module corresponding to the target recognition method in the embodiment of the present application.
  • the processor 401 executes various functional applications and data processing of the target recognition device by executing non-transitory software programs, instructions, and modules stored in the memory 402, that is, implementing the target recognition method of any of the above method embodiments.
  • the memory 402 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the target identification device, and the like.
  • memory 402 can include high speed random access memory, and can also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device.
  • memory 402 can optionally include memory remotely located relative to processor 401, which can be connected to smart terminal 400 over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the one or more modules are stored in the memory 402, and when executed by the one or more processors 401, perform a target recognition method in any of the above method embodiments, for example, performing the above described FIG. Method step 110 to step 120, method step 210 to step 220 in FIG. 2, implement the functions of units 31-33 in FIG.
  • the embodiment of the present application further provides a non-transitory computer readable storage medium, the non-transient computing
  • the machine readable storage medium stores computer executable instructions that are executed by one or more processors, such as by one of the processors 401 of FIG. 4, such that the one or more processors perform any of the above
  • the object recognition method in the method embodiment for example, performs the method steps 110 to 120 in FIG. 1 described above, and the method steps 210 to 220 in FIG. 2 implement the functions of the units 31-33 in FIG.
  • the device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • the various embodiments can be implemented by means of software plus a general hardware platform, and of course, by hardware.
  • a person skilled in the art can understand that all or part of the process of implementing the above embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a non-transitory computer readable storage medium.
  • the program when executed, may include the flow of an embodiment of the methods as described above.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
  • the above product can perform the target recognition method provided by the embodiment of the present application, and has the corresponding functional modules and beneficial effects of performing the target recognition method.
  • the object recognition method provided by the embodiment of the present application.

Abstract

A target recognition method and apparatus, and an intelligent terminal. The method comprises: collecting information about a target to be detected (110), wherein the target to be detected comprises at least two attribute types, and a priority relationship is set between the at least two attribute types; and outputting, based on the information, a recognition result of the target to be detected (120), wherein the recognition result is a determination result corresponding to one of the attribute types, the degree of confidence of the determination result satisfies a pre-set condition, and the attribute type corresponding to the recognition result has the highest priority level among the attribute types corresponding to the determination result, the degree of confidence of which satisfies the pre-set condition. The method can ensure the reliability of the output recognition result under different recognition scenarios, and also output more detailed recognition results as far as possible, thereby improving the user experience.

Description

一种目标识别方法、装置和智能终端Target recognition method, device and intelligent terminal 技术领域Technical field
本申请实施例涉及智能识别技术领域,尤其涉及一种目标识别方法、装置和智能终端。The embodiments of the present invention relate to the field of intelligent identification technologies, and in particular, to a target identification method, apparatus, and intelligent terminal.
背景技术Background technique
随着机器智能化进程的推进,人与智能终端之间的交互越来越频繁,人机交互的自然体验问题也随之变得越来越重要。其中,影响人机交互的自然体验的一个重要因素就是智能终端对待测目标的识别的详细程度和可靠性。With the advancement of the machine intelligence process, the interaction between people and intelligent terminals becomes more and more frequent, and the natural experience of human-computer interaction becomes more and more important. Among them, an important factor affecting the natural experience of human-computer interaction is the level of detail and reliability of the identification of the target to be measured by the intelligent terminal.
当前,大多数智能终端都被希望能够输出人名、车的型号(或者系列)、车牌号码、猫的品种等详细程度较高的目标识别结果,以提升人机交互体验。Currently, most smart terminals are expected to output a higher-level target recognition result such as a person's name, model (or series) of the car, license plate number, and cat breed to enhance the human-computer interaction experience.
然而,在实际场景中,环境是多变的,而智能终端的识别能力是有限的,在某些场景下智能终端有可能无法精确识别待测目标,比如,因为光线、角度或者遮挡等原因无法识别出这个人是谁,又如,因为距离和角度的原因不能确信车的品牌或型号。在这种情况下,如果强制要求智能终端输出详细程度较高的识别结果,有可能会因为识别错误而带来尴尬;或者,如果因为得不到详细程度高的识别结果而放弃输出结果,也不利于用户友好体验。However, in an actual scenario, the environment is variable, and the recognition capability of the smart terminal is limited. In some scenarios, the smart terminal may not be able to accurately identify the target to be tested, for example, due to light, angle, or occlusion. Identify who this person is, and for example, because of the distance and angle, you can't be sure of the brand or model of the car. In this case, if the intelligent terminal is required to output a higher-level recognition result, it may cause embarrassment due to the recognition error; or, if the output result is discarded because the detailed result is not obtained, Not conducive to a user-friendly experience.
由此,如何在目标识别的可靠性与详细程度之间达到折中是现有的智能识别技术亟待解决的问题。Therefore, how to achieve a compromise between the reliability and the level of detail of the target recognition is an urgent problem to be solved by the existing intelligent identification technology.
发明内容Summary of the invention
本申请实施例提供一种目标识别方法、装置和智能终端,能够解决如何在目标识别的可靠性与详细程度之间达到折中的问题。The embodiment of the present application provides a target recognition method, device, and intelligent terminal, which can solve the problem of how to achieve a compromise between reliability and detail level of target recognition.
第一方面,本申请实施例提供了一种目标识别方法,应用于智能终端,包 括:In a first aspect, an embodiment of the present application provides a target identification method, which is applied to an intelligent terminal, and includes include:
采集针对待测目标的信息,所述待测目标包括至少两种属性类型,所述至少两种属性类型之间设置有优先级关系;Collecting information for the object to be tested, the object to be tested includes at least two types of attributes, and a priority relationship is set between the at least two types of attributes;
基于所述信息输出所述待测目标的识别结果,所述识别结果为其中一种所述属性类型对应的判断结果,所述判断结果的置信度满足预设条件,并且,所述识别结果对应的属性类型在置信度满足所述预设条件的判断结果对应的属性类型中优先级最高。And outputting, according to the information, the recognition result of the object to be tested, the recognition result is a determination result corresponding to one of the attribute types, the confidence of the determination result satisfies a preset condition, and the recognition result corresponds to The attribute type has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
第二方面,本申请实施例提供一种目标识别装置,包括:In a second aspect, an embodiment of the present application provides a target identification apparatus, including:
信息采集单元,用于采集针对待测目标的信息,所述待测目标包括至少两种属性类型,所述至少两种属性类型之间设置有优先级关系;An information collecting unit, configured to collect information about a target to be tested, where the object to be tested includes at least two types of attributes, and a priority relationship is set between the at least two types of attributes;
识别单元,用于基于所述信息输出所述待测目标的识别结果,所述识别结果为其中一种所述属性类型对应的判断结果,所述判断结果的置信度满足预设条件,并且,所述识别结果对应的属性类型在置信度满足所述预设条件的判断结果对应的属性类型中优先级最高。a recognition unit, configured to output a recognition result of the object to be tested based on the information, where the recognition result is a determination result corresponding to one of the attribute types, the confidence of the determination result satisfies a preset condition, and The attribute type corresponding to the recognition result has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
第三方面,本申请实施例提供一种智能终端,包括:In a third aspect, an embodiment of the present application provides an intelligent terminal, including:
至少一个处理器;以及,At least one processor; and,
与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein
所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如上所述的目标识别方法。The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the target recognition method as described above.
第四方面,本申请实施例提供一种非暂态计算机可读存储介质,所述非暂态计算机可读存储介质存储有计算机可执行指令,所述计算机可执行指令用于使智能终端执行如上所述的目标识别方法。In a fourth aspect, an embodiment of the present application provides a non-transitory computer readable storage medium, where the non-transitory computer readable storage medium stores computer executable instructions for causing a smart terminal to execute the above The target recognition method.
第五方面,本申请实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被智能终端执行时,使所述智能终端执行如上 所述的目标识别方法。In a fifth aspect, the embodiment of the present application further provides a computer program product, the computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program When the instruction is executed by the smart terminal, causing the smart terminal to execute as above The target recognition method.
本申请实施例的有益效果在于:本申请实施例提供的目标识别方法、装置和智能终端通过根据对待测目标的描述的详细程度的不同为待测目标的属性划分多个具有优先级顺序的属性类型,并且在识别的过程中,获取每一属性类型下的判断结果的置信度,继而根据实际识别情况输出置信度满足预设条件的判断结果中优先级最高的属性类型对应的判断结果作为所述待测目标的识别结果,能够在不同的识别场景下,确保输出的识别结果的可靠性,同时,尽可能地输出更详细的识别结果,即,使得最终得到的识别结果能够在可靠性和详细程度之间达到折中,从而提升用户体验。The beneficial effects of the embodiments of the present application are as follows: the target identification method, apparatus, and intelligent terminal provided by the embodiments of the present application divide multiple attributes with priority order for the attributes of the target to be tested according to the degree of detail of the description of the object to be tested. Type, and in the process of identification, obtain the confidence of the judgment result under each attribute type, and then output the judgment result corresponding to the attribute type with the highest priority among the judgment results satisfying the preset condition according to the actual recognition situation. The recognition result of the target to be measured can ensure the reliability of the output recognition result under different recognition scenarios, and at the same time, output a more detailed recognition result as much as possible, that is, the final recognition result can be reliability and A compromise between levels of detail improves the user experience.
附图说明DRAWINGS
一个或多个实施例通过与之对应的附图中的图片进行示例性说明,这些示例性说明并不构成对实施例的限定,附图中具有相同参考数字标号的元件表示为类似的元件,除非有特别申明,附图中的图不构成比例限制。The one or more embodiments are exemplified by the accompanying drawings in the accompanying drawings, and FIG. The figures in the drawings do not constitute a scale limitation unless otherwise stated.
图1是本申请实施例提供的一种目标识别方法的流程示意图;FIG. 1 is a schematic flowchart diagram of a target recognition method according to an embodiment of the present application;
图2是本申请实施例提供的另一种目标识别方法的流程示意图;2 is a schematic flow chart of another object recognition method provided by an embodiment of the present application;
图3是本申请实施例提供的一种目标识别装置的结构示意图;3 is a schematic structural diagram of a target recognition apparatus according to an embodiment of the present application;
图4是本申请实施例提供的一种智能终端的硬件结构示意图。FIG. 4 is a schematic structural diagram of hardware of an intelligent terminal according to an embodiment of the present application.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。In order to make the objects, technical solutions, and advantages of the present application more comprehensible, the present application will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the application and are not intended to be limiting.
需要说明的是,如果不冲突,本申请实施例中的各个特征可以相互结合,均在本申请的保护范围之内。另外,虽然在装置示意图中进行了功能模块划分,在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于装置中的模块划分,或流程图中的顺序执行所示出或描述的步骤。 It should be noted that, if there is no conflict, the various features in the embodiments of the present application may be combined with each other, and are all within the protection scope of the present application. In addition, although the functional module partitioning is performed in the device schematic, the logical sequence is shown in the flowchart, but in some cases, the illustrated may be performed in a different manner from the modules in the device, or in the order in the flowchart. Or the steps described.
本申请实施例提供了一种目标识别方法、装置和智能终端,能够适用于任意与目标识别相关的应用领域,比如:智能导盲、迎宾机器人、服务机器人、入侵对象探测、语义识别等。The embodiment of the present application provides a target recognition method, device, and intelligent terminal, which can be applied to any application field related to target recognition, such as: intelligent guide blind, welcome robot, service robot, intrusion object detection, semantic recognition, and the like.
其中,本申请实施例提供的目标识别方法是一种基于待测目标的属性类型的“优先级”以及每一属性类型下的判断结果的“置信度”的智能优化识别方法,通过根据对待测目标的描述的详细程度的不同为待测目标的属性划分多个具有优先级顺序的属性类型(其中,优先级越高的属性类型对应的判断结果的详细程度越高),并且在识别的过程中,对每一属性类型下的判断结果设置用于评价该判断结果的可靠性的置信度,继而根据实际识别情况输出置信度满足预设条件的判断结果中优先级最高的属性类型对应的判断结果作为所述待测目标的识别结果,能够在不同的识别场景下,确保输出的识别结果的可靠性,同时,尽可能地输出更详细的识别结果,即,使得最终得到的识别结果能够在可靠性和详细程度之间达到折中,从而提升用户体验。The target recognition method provided by the embodiment of the present application is an intelligent optimization identification method based on the “priority” of the attribute type of the target to be measured and the “confidence” of the determination result under each attribute type, according to the The degree of detail of the description of the target is divided into a plurality of attribute types having priority order for the attribute of the target to be tested (where the higher the priority type, the higher the degree of detail corresponding to the judgment result), and in the process of identification In the judgment result under each attribute type, a confidence level for evaluating the reliability of the judgment result is set, and then the judgment corresponding to the attribute type having the highest priority among the judgment results satisfying the preset condition is output according to the actual recognition situation. As a result of the recognition of the object to be tested, it is possible to ensure the reliability of the output recognition result under different recognition scenarios, and at the same time, output a more detailed recognition result as much as possible, that is, the final recognition result can be obtained at A compromise between reliability and level of detail enhances the user experience.
由此,采用本申请实施例提供的目标识别方法、装置和智能终端识别相同的人/物(待测目标)时,在不同识别环境下可能输出不同详细程度的识别结果。例如:以识别人为例,在光照好,距离近,并且被测人正对机器摄像头时,可以识别出被测人的“人名”;当被测人用手遮住半张脸,或侧对机器摄像头时,只能识别出被测人的“性别”;当被测人背对机器摄像头时,只能识别出是否是个“人”。Therefore, when the target recognition method and device and the intelligent terminal provided by the embodiment of the present application identify the same person/object (the object to be tested), different levels of recognition results may be output under different recognition environments. For example, in the case of identifying a person, when the light is good, the distance is close, and the person being tested is facing the machine camera, the "person name" of the person to be tested can be identified; when the person being tested covers half of the face by hand, or sideways When the machine camera is used, only the "gender" of the person being tested can be identified; when the person being tested is facing the camera of the machine, only whether it is a "person" can be identified.
本申请实施例提供的目标识别方法和装置能够应用于任意类型的智能终端,比如:机器人、导盲眼镜、智能头盔、智能手机、平板电脑、服务器等。该智能终端可以包括任何合适类型的,用以存储数据的存储介质,例如磁碟、光盘(CD-ROM)、只读存储记忆体或随机存储记忆体等。该智能终端还可以包括一个或者多个逻辑运算模块,单线程或者多线程并行执行任何合适类型的功能或者操作,例如查看数据库、图像处理等。所述逻辑运算模块可以是任何合适类型的,能够执行逻辑运算操作的电子电路或者贴片式电子器件,例如:单核心处理器、多核心处理器、图形处理器(GPU)等。 The object recognition method and apparatus provided by the embodiments of the present application can be applied to any type of smart terminal, such as a robot, a guide glasses, a smart helmet, a smart phone, a tablet computer, a server, and the like. The smart terminal can include any suitable type of storage medium for storing data, such as a magnetic disk, a compact disc (CD-ROM), a read-only memory or a random access memory. The smart terminal may also include one or more logical computing modules that perform any suitable type of function or operation in parallel, such as viewing a database, image processing, etc., in a single thread or multiple threads. The logic operation module may be any suitable type of electronic circuit or chip-type electronic device capable of performing logical operation operations, such as a single core processor, a multi-core processor, a graphics processing unit (GPU), or the like.
具体地,下面结合附图,对本申请实施例作进一步阐述。Specifically, the embodiments of the present application are further described below in conjunction with the accompanying drawings.
实施例一Embodiment 1
图1是本申请实施例提供的一种目标识别方法的流程示意图,请参阅图1,该方法包括但不限于:1 is a schematic flowchart of a target identification method according to an embodiment of the present application. Referring to FIG. 1, the method includes but is not limited to:
110、采集针对待测目标的信息。110. Collect information about the target to be tested.
在本实施例中,待测目标可以包括但不限于:人、动物、物体等。根据对待测目标的描述的详细程度的不同可以为待测目标划分至少两个不同层级的属性类型,并且,按照所对应的详细程度的高低,为这些属性类型设置优先级关系。其中,可以认为识别难度较大的属性类型对应的详细程度较高,而识别的难易程度可以依据不同属性类型的识别算法在相同条件下(例如输入相同的图片)的识别率来排序(例如,通常人名识别难于性别识别,性别识别难于人脸/人体识别);或者,也可以依据属性类型间的相互包含关系来排序(例如,要识别性别需先识别到人脸的存在)。In this embodiment, the target to be tested may include, but is not limited to, a person, an animal, an object, and the like. According to the degree of detail of the description of the object to be measured, at least two different levels of attribute types may be divided for the object to be tested, and priority relationships are set for the attribute types according to the level of detail. Among them, it can be considered that the attribute type with difficulty in identifying is relatively high in detail, and the difficulty level of recognition can be sorted according to the recognition rate of different attribute types under the same conditions (for example, inputting the same picture) (for example) Usually, person name recognition is difficult to identify by gender, gender recognition is difficult for face/human recognition); or, it can be sorted according to the mutual inclusion relationship between attribute types (for example, to identify the gender, the presence of the face must be recognized first).
举例说明:假设待测目标为人,可以根据对待测目标的描述的详细程度的不同,设置人的属性类型包括:“人名”、“性别”以及“是否为人”,而根据识别的难易程度,可以设置这些属性类型的优先级顺序为:L1(人名)>L2(性别)>L3(是否为人)。又如,假设待测目标为车,则可以设置车的属性类型包括:“车牌”、“车的型号”、“车的颜色”以及“是否为车”,而根据识别的难易程度,可以设置这些属性类型的优先级顺序为:L1(车牌)>L2(车的型号)>L3(车的颜色)>L4(是否为车)。For example: assuming that the target to be tested is a person, the attribute type of the person may be set according to the degree of detail of the description of the object to be tested, including: "person name", "gender", and "whether it is a person", and according to the difficulty level of recognition, You can set the priority order of these attribute types to be: L1 (person name) > L2 (gender) > L3 (whether it is human). For another example, if the target to be tested is a vehicle, the attribute types of the vehicle can be set to include: “a license plate”, “a model of the vehicle”, “the color of the vehicle”, and “whether or not the vehicle”, and according to the difficulty level of the identification, Set the priority order of these attribute types to: L1 (vehicle license plate) > L2 (model of the car) > L3 (color of the car) > L4 (whether it is a car).
此外,在本实施例中,所述“信息”可以是任意能够反映待测目标的属性的判断依据,该信息的类型可以包括但不限于:图像信息、声音信息、热红外画面、近红外画面、超声信号、电磁反射信号等。In addition, in this embodiment, the “information” may be any judgment basis that can reflect the attribute of the object to be tested, and the type of the information may include, but is not limited to, image information, sound information, thermal infrared image, near-infrared image. , ultrasonic signals, electromagnetic reflection signals, etc.
在执行本步骤110时,可以通过一种或者多种传感器采集针对待测目标的信息,比如,通过摄像头采集针对待测目标的图像信息、通过麦克风采集针对待测目标的声音信息、通过热红外传感器采集针对待测目标的热红外画面等。When performing this step 110, information about the object to be tested may be collected by one or more sensors, for example, collecting image information for the object to be tested through the camera, collecting sound information for the object to be tested through the microphone, and passing the thermal infrared. The sensor collects a thermal infrared image or the like for the target to be tested.
120、基于所述信息输出所述待测目标的识别结果。 120. Output a recognition result of the object to be tested based on the information.
在本实施例中,在基于采集到的信息识别待测目标的过程中,待测目标每一属性类型下对应有一个判断结果,而每一判断结果对应有一个用于表征该判断结果的可靠性(或,可信性)的置信度。比如:基于采集到的针对某人的图像信息,获取到待测目标的判断结果包括:“张三”(置信度为70%),“男性”(置信度为89%),“人”(置信度为100%),则,“张三”、“男性”和“人”即分别为该待测目标的属性类型“人名”、“性别”和“是否为人”对应的判断结果。其中,判断结果的置信度可以通过特征比对的相似程度来确定,相似程度越高,置信度越高。In the embodiment, in the process of identifying the target to be tested based on the collected information, each attribute type of the object to be tested corresponds to a determination result, and each determination result corresponds to a reliable one for characterizing the determination result. Confidence in sexuality (or credibility). For example, based on the collected image information for a person, the judgment results obtained by the target to be tested include: “Zhang San” (confidence is 70%), “male” (confidence is 89%), “person” ( The confidence level is 100%), then, "Zhang San", "Men" and "People" are the judgment results corresponding to the attribute types "person name", "gender" and "whether or not" of the object to be tested. The confidence level of the judgment result can be determined by the similarity degree of the feature comparison, and the higher the degree of similarity, the higher the confidence degree.
特别地,在本实施例中,所输出的识别结果为待测目标其中一种属性类型对应的判断结果,该判断结果的置信度满足预设条件,并且,该识别结果对应的属性类型在置信度满足所述预设条件的判断结果对应的属性类型中优先级最高。In particular, in the embodiment, the outputting result is a determination result corresponding to one of the attribute types of the object to be tested, the confidence of the determination result satisfies a preset condition, and the attribute type corresponding to the recognition result is in confidence. The priority of the attribute type corresponding to the judgment result satisfying the preset condition is the highest.
其中,所述“预设条件”可以根据实际应用场景而设置,用于鉴定某一判断结果的可靠程度。具体地,该预设条件可以是:判断结果的置信度大于或者等于与其对应的属性类型所对应的置信阈值。其中,每一属性类型对应的置信阈值可以是相同的,比如,与属性类型“人名”、“性别”和“是否为人”对应的置信阈值均为70%,若获取到待测目标的判断结果包括:“张三”(置信度为70%),“男性”(置信度为89%),“人”(置信度为100%),则,判断结果“张三”、“男性”和“人”的置信度均满足预设条件,此时,该待测目标的识别结果为这三者中优先级最高的属性类型“人名”对应的判断结果“张三”。或者,在另一些实施例中,每一属性类型对应的置信阈值也可以是不相同的,比如,可以预设与属性类型“人名”对应的置信阈值为75%,与属性类型“性别”对应的置信阈值为85%,与属性类型“是否为人”对应的置信阈值为95%,若获取到待测目标的判断结果同样是:“张三”(置信度为70%),“男性”(置信度为89%),“人”(置信度为100%),则,置信度满足预设条件的判断结果仅包括“男性”和“人”,此时,该待测目标的识别结果为这两者中优先级最高的属性类型“性别”对应的判断结果“男性”。The “preset condition” may be set according to an actual application scenario, and used to identify the reliability of a certain judgment result. Specifically, the preset condition may be: a confidence level of the determination result is greater than or equal to a confidence threshold corresponding to the corresponding attribute type. The confidence threshold corresponding to each attribute type may be the same. For example, the confidence thresholds corresponding to the attribute types “person name”, “gender”, and “whether or not” are both 70%, and the judgment result of the target to be tested is obtained. Including: "Zhang San" (confidence is 70%), "male" (confidence is 89%), "person" (confidence is 100%), then, the judgment results "Zhang San", "male" and " The confidence degree of the person meets the preset condition. At this time, the recognition result of the object to be tested is the judgment result "Zhang San" corresponding to the attribute type "person name" having the highest priority among the three. Or, in other embodiments, the confidence threshold corresponding to each attribute type may also be different. For example, the confidence threshold corresponding to the attribute type “person name” may be preset to be 75%, corresponding to the attribute type “gender”. The confidence threshold is 85%, and the confidence threshold corresponding to the attribute type “is it human” is 95%. If the target to be tested is the same, “Zhang San” (confidence is 70%), “male” ( Confidence is 89%), "People" (confidence is 100%), then the judgment result that the confidence meets the preset condition includes only "male" and "person". At this time, the recognition result of the target to be tested is The highest priority attribute type "gender" corresponds to the judgment result "male".
具体地,在本实施例中,基于采集到的信息输出待测目标的识别结果的具 体实施方式可以包括但不限于以下两种实施方式:Specifically, in the embodiment, the identification result of the target to be tested is output based on the collected information. Embodiments may include, but are not limited to, the following two embodiments:
在第一种实施方式中,可以首先基于采集到的信息获取待测目标每一属性类型对应的判断结果以及每一判断结果的置信度;然后输出置信度满足预设条件的判断结果中优先级最高的属性类型对应的判断结果作为所述待测目标的识别结果。In the first implementation manner, the determination result corresponding to each attribute type of the object to be tested and the confidence of each determination result may be firstly obtained based on the collected information; and then the priority of the determination result that the confidence degree satisfies the preset condition is output. The judgment result corresponding to the highest attribute type is used as the recognition result of the object to be tested.
其中,在该实施例方式中,基于采集到的信息获取待测目标每一属性类型对应的判断结果可以通过使用合适的算法(比如,神经网络)来实现。比如,假设待测目标为人,智能终端采集到的信息为该人的图像信息,则,智能终端可以从该图像中迭代式地计算出属性类型“是否为人”、“性别”和“人名”对应的判断结果,比如,首先通过神经网络的底层计算出用于判别“是否为人”的特征1,并根据特征1得到“是否为人”对应判断结果及该判断结果的置信度;然后,在神经网络的中间层基于特征1计算用于判别“性别”的特征2,并根据特征2得到“性别”对应的判断结果及该判断结果的置信度;最后,在神经网络的最上层基于特征2计算出用于判别“人名”的特征3,并根据特征3得到“人名”对应的判断结果及该判断结果的置信度。当获取到所有判断结果及其置信度之后,首先筛选出置信度满足预设要求的判断结果,然后选择对待测目标的描述的详细程度最高(即,所对应的属性类型优先级最高)的判断结果作为待测目标的识别结果。In this embodiment manner, the determination result corresponding to each attribute type of the object to be tested based on the collected information may be implemented by using a suitable algorithm (for example, a neural network). For example, if the target to be measured is a person and the information collected by the smart terminal is the image information of the person, the smart terminal may iteratively calculate the attribute type “whether it is a person”, “gender” and “person name” from the image. The judgment result, for example, firstly calculates the feature 1 for discriminating whether it is a person by the bottom layer of the neural network, and obtains the judgment result of "whether it is a person" according to the feature 1 and the confidence of the judgment result; then, in the neural network The middle layer calculates feature 2 for discriminating "gender" based on feature 1, and obtains the judgment result corresponding to "gender" according to feature 2 and the confidence of the judgment result; finally, based on feature 2 at the uppermost layer of the neural network The feature 3 for discriminating the "person name" is obtained, and according to the feature 3, the judgment result corresponding to the "person name" and the confidence of the judgment result are obtained. After all the judgment results and their confidences are obtained, the judgment result that the confidence degree satisfies the preset requirement is first screened, and then the judgment of the target to be tested is selected to the highest degree of detail (that is, the corresponding attribute type has the highest priority). The result is the recognition result of the target to be tested.
在第二种实施方式中,可以基于采集到的信息,根据优先级从高到低的顺序逐级获取待测目标每一属性类型对应的判断结果以及每一判断结果的置信度,直至第一个置信度满足预设条件的判断结果出现时,输出该第一个置信度满足预设条件的判断结果作为所述待测目标的识别结果。即:当采集到针对待测目标的信息时,首先基于采集到的信息获取优先级最高的属性类型对应的一级判断结果以及一级判断结果的一级置信度,如果该一级置信度满足预设条件(比如,一级置信度大于或等于一级置信阈值),则直接输出该一级判断结果作为待测目标的识别结果,否则,基于采集到的信息获取下一级别的属性类型对应的二级判断结果以及二级判断结果的二级置信度;如果该二级置信度满足预设条件(比如,二级置信度大于或者等于二级置信阈值),则,输出该二级判断结果 作为待测目标的识别结果,否则,继续基于采集到的信息获取再下一级别的属性类型对应的判断结果及其置信度,如此循环,直至获取到置信度满足预设条件的判断结果。In the second implementation manner, based on the collected information, the determination result corresponding to each attribute type of the target to be tested and the confidence of each determination result may be obtained step by step according to the priority from high to low, until the first When the determination result that the confidence degree satisfies the preset condition appears, the determination result that the first confidence degree satisfies the preset condition is output as the recognition result of the object to be tested. That is, when the information for the object to be tested is collected, firstly, based on the collected information, the first-level judgment result corresponding to the attribute type with the highest priority and the first-level confidence level of the first-level judgment result are obtained, if the first-level confidence is satisfied. If the preset condition (for example, the first-level confidence is greater than or equal to the first-level confidence threshold), the first-level judgment result is directly output as the recognition result of the target to be tested, otherwise, the attribute level corresponding to the next level is obtained based on the collected information. The second-level judgment result and the second-level confidence of the second-level judgment result; if the second-level confidence level satisfies a preset condition (for example, the second-level confidence level is greater than or equal to the second-level confidence threshold), the second-level judgment result is output As a result of the identification of the target to be tested, otherwise, the determination result corresponding to the attribute type of the next level and the confidence thereof are obtained based on the collected information, and the loop is repeated until the determination result that the confidence level satisfies the preset condition is obtained.
其中,在该实施方式中,可以从采集到的信息中提取出不同的特征用于不同级别的判断,例如,假设待测目标为车,采集到的信息为针对该车的图像信息,则,可以从该图像信息中提取出特征a用于识别图像中是否有车,提取出特征b用于识别图像中车的颜色,提取特征c用于识别车的类型(轿车、货车、公交车等)等。In this embodiment, different features may be extracted from the collected information for different levels of judgment. For example, if the target to be measured is a vehicle and the collected information is image information for the vehicle, The feature a can be extracted from the image information for identifying whether there is a car in the image, the feature b is extracted for identifying the color of the car in the image, and the feature c is extracted for identifying the type of the car (car, truck, bus, etc.) Wait.
在该实施方式中,通过根据优先级从高到低的顺序逐级获取待测目标每一属性类型对应的判断结果及其置信度,当出现第一个置信度满足预设条件的判断结果时,就直接输出该第一个置信度满足预设条件的判断结果,而不需要对每一个属性类型进行识别判断,能够减少数据处理量,在不影响详细程度和可靠性的前提下,提升识别效率。In this embodiment, the judgment result corresponding to each attribute type of the object to be tested and the confidence thereof are obtained step by step according to the order of priority from high to low, when the first confidence level satisfies the judgment result of the preset condition Directly outputting the judgment result that the first confidence degree satisfies the preset condition, without identifying and judging each attribute type, thereby reducing the amount of data processing, and improving the recognition without affecting the level of detail and reliability. effectiveness.
此外,针对不同的应用场景以及应用需求,还可以在上述步骤110和120的基础上进行进一步的扩展。In addition, for different application scenarios and application requirements, further expansion can be performed on the basis of the above steps 110 and 120.
比如,在一些可以进行人机交互的应用场景中,如,智能导盲、迎宾机器人、服务机器人等,该目标识别方法还可以包括:发送与所述识别结果对应的交互信号。For example, in some application scenarios in which human-computer interaction is possible, such as smart guide blind, a welcome robot, a service robot, etc., the target recognition method may further include: transmitting an interaction signal corresponding to the recognition result.
举例说明:如果步骤120输出的识别结果为“张三”,用于导盲的智能眼镜或者智能头盔可以向用户发出语音提示“前面是你的朋友张三”,用于迎宾或者提供服务的机器人可以对待测目标说“您好!VIP客户张三!”,和/或,调整出VIP客户专属的手势。又如,如果步骤120输出的识别结果为“男性”,用于导盲的智能眼镜或者智能头盔可以向用户发出语音提示“前面有位男士”,用于迎宾或者提供服务的机器人可以对待测目标说“您好!先生!”。For example: if the recognition result outputted in step 120 is “Zhang San”, the smart glasses or smart helmet for guiding blind can send a voice prompt “Your friend Zhang San” to the user for welcoming or providing service. The robot can say "Hello! VIP customer Zhang San!" to the target, and/or adjust the gestures exclusive to VIP customers. For another example, if the recognition result outputted in step 120 is “male”, the smart glasses or the smart helmet for guiding the blind can give the user a voice prompt “there is a man in front”, and the robot for welcoming or providing the service can be tested. The goal is "Hello! Sir!".
通过上述技术方案可知,本申请实施例的有益效果在于:本申请实施例提供的目标识别方法通过根据对待测目标的描述的详细程度为待测目标的属性划分多个具有优先级顺序的属性类型,并且在识别的过程中,获取每一属性类型 下的判断结果的置信度,继而根据实际识别情况输出置信度满足预设条件的判断结果中优先级最高的属性类型对应的判断结果作为所述待测目标的识别结果,能够在不同的识别场景下,确保输出的识别结果的可靠性,同时,尽可能地输出更详细的识别结果,即,使得最终得到的识别结果能够在可靠性和详细程度之间达到折中,从而提升用户体验。According to the foregoing technical solution, the object recognition method provided by the embodiment of the present application is that the target identification method provided by the embodiment of the present application divides multiple attribute types with priority order for the attribute of the target to be tested according to the detailed level of the description of the object to be tested. And in the process of identification, get each attribute type The confidence of the result of the determination is determined, and then the judgment result corresponding to the attribute type having the highest priority among the judgment results satisfying the preset condition is output as the recognition result of the object to be tested according to the actual recognition situation, and can be in different recognition scenarios. The reliability of the recognition result of the output is ensured, and at the same time, the more detailed recognition result is output as much as possible, that is, the final recognition result can be compromised between reliability and detail level, thereby improving the user experience.
实施例二Embodiment 2
进一步地,为了提升目标识别的效率以及详细程度,本申请第二实施例还提供了另一种目标识别方法,在本实施例中,采集到的信息包括至少两种信息源。Further, in order to improve the efficiency and the level of detail of the target recognition, the second embodiment of the present application further provides another target identification method. In this embodiment, the collected information includes at least two types of information sources.
具体地,请参阅图2,该方法包括但不限于:Specifically, please refer to FIG. 2, which includes but is not limited to:
210、采集针对待测目标的至少两种信息源。210. Collect at least two information sources for the target to be tested.
在本实施例中,所述“信息源”是指能够反映待测目标的属性的信息来源。所述“至少两种信息源”可以是至少两种不同类型的信息,比如,图像信息、声音信息、热红外画面、近红外画面、超声信号或电磁反射信号中的任意两种或者多种;或者,所述“至少两种信息源”也可以是从至少两个角度或者时刻采集到的某一类型的信息,比如,从多个角度采集待测目标的图像信息(或者声音信息),每一视角采集到的图像信息(或者声音信息)均可作为一种信息源。当然,可以理解的是,所述“至少两种信息源”也可以是上述两种形式的组合,比如,针对待测目标采集到的信息中包括从多个角度采集到的图像信息和从一个角度采集到的声音信息。In the present embodiment, the "information source" refers to a source of information capable of reflecting an attribute of a target to be tested. The “at least two information sources” may be at least two different types of information, such as any two or more of image information, sound information, thermal infrared images, near infrared images, ultrasonic signals, or electromagnetic reflection signals; Alternatively, the “at least two information sources” may also be some type of information collected from at least two angles or moments, for example, image information (or sound information) of the target to be measured is collected from multiple angles, and each Image information (or sound information) acquired from a single perspective can be used as a source of information. Of course, it can be understood that the “at least two information sources” may also be a combination of the above two forms. For example, the information collected for the target to be tested includes image information collected from multiple angles and from one image. Sound information collected at an angle.
在本实施例中,采集每一信息源的具体实施方式可以参考上述实施例一中的步骤110,此处便不再详述。In this embodiment, the specific implementation manner of collecting each information source may refer to step 110 in the foregoing embodiment 1, and will not be described in detail herein.
220、基于所述至少两种信息源输出待测目标的识别结果。220. Output a recognition result of the target to be tested based on the at least two information sources.
在本实施例中,通过多信息融合的方式获取待测目标的识别结果。In this embodiment, the recognition result of the target to be tested is obtained by means of multi-information fusion.
具体地,在本实施例中,基于采集到的至少两种信息源输出待测目标的识别结果的具体实施方式可以包括但不限于以下三种实施方式: Specifically, in this embodiment, the specific implementation manner of outputting the recognition result of the object to be tested based on the collected at least two types of information sources may include, but is not limited to, the following three implementation manners:
在第一种实施方式中,可以采用“分模式融合”的方式识别待测目标,即:首先分别基于采集到的至少两种信息源获取该待测目标的子识别结果,然后根据这些子识别结果输出该待测目标的识别结果。其中,所述“子识别结果”是指仅基于一种信息源得到的识别结果,每一信息源对应一个子识别结果。由此,在该实施方式中,子识别结果也包括至少两个,每个子识别结果均有相应的置信度,用于评价该子识别结果的可靠性。In the first embodiment, the target to be tested may be identified by means of “sub-mode fusion”, that is, firstly, the sub-recognition results of the object to be tested are acquired based on the collected at least two information sources, and then the sub-identifications are determined according to the sub-identifications. The result outputs the recognition result of the object to be tested. The “sub-recognition result” refers to a recognition result obtained based on only one information source, and each information source corresponds to one sub-recognition result. Therefore, in this embodiment, the sub-recognition result also includes at least two, and each sub-recognition result has a corresponding confidence level for evaluating the reliability of the sub-recognition result.
具体地,在该实施例方式中,可以首先通过实施例一提供的目标识别方法(如图1所示)分别获取每一信息源对应的子识别结果,然后,从这些子识别结果中筛选出最详细的子识别结果作为该待测目标的识别结果。其中,子识别结果的详细程度可以通过子识别结果对应的属性类型的优先级来判定,所对应的属性类型的优先级越高,详细程度越高,比如,假设获取到的子识别结果包括“人”和“女生”,其中,子识别结果“人”对应的属性类型为“是否为人”,子识别结果“女生”对应的属性类型为“性别”,而“性别”的优先级高于“是否为人”,则,子识别结果“女生”的详细程度高于子识别结果“人”,从而可以以子识别结果“女生”作为该待测目标的识别结果。Specifically, in this embodiment manner, the sub-recognition result corresponding to each information source may be separately obtained by the target identification method (shown in FIG. 1) provided in the first embodiment, and then selected from the sub-recognition results. The most detailed sub-identification result is used as the recognition result of the object to be tested. The level of detail of the sub-recognition result may be determined by the priority of the attribute type corresponding to the sub-recognition result, and the higher the priority of the corresponding attribute type, the higher the level of detail, for example, the obtained sub-recognition result includes “ "person" and "girl", wherein the attribute type corresponding to the child recognition result "person" is "whether it is a person", the attribute type of the child recognition result "girl" is "gender", and the priority of "gender" is higher than " If the person is a person, the sub-recognition result "girl" is higher in detail than the sub-recognition result "person", so that the sub-recognition result "girl" can be used as the recognition result of the object to be tested.
举例说明:假设采集到的信息包括图像信息和声音信息,则,可以基于采集到的图像信息执行上述实施例一中的步骤110~120;同时,基于采集到的声音信息执行上述实施例一中的步骤110~120。假设基于采集到的图像信息输出的子识别结果为“人”,而基于采集到的声音信息输出的子识别结果为“李四”,则,可以输出详细程度更高的子识别结果“李四”作为待测目标的识别结果。For example, if the collected information includes the image information and the sound information, the steps 110 to 120 in the first embodiment may be performed based on the collected image information; and the first embodiment is executed based on the collected sound information. Steps 110 to 120. It is assumed that the sub-recognition result output based on the acquired image information is “person”, and the sub-recognition result output based on the collected sound information is “Li Si”, and the sub-recognition result “Li Si” with higher level of detail can be output. "As the result of the identification of the target to be tested.
此外,在实际的应用中,也有可能存在获取到的详细程度最高的子识别结果包括多个,并且多个子识别结果之间存有矛盾的情况,比如,获取到的详细程度最高的子识别结果包括“男生”和“女生”,子识别结果“男生”和“女生”对应的属性类型均为“性别”,但同一属性类型下只能输出一个识别结果。此时,可以从详细程度最高的子识别结果中选择置信度最高的子识别结果作为该待测目标的识别结果,比如,子识别结果“男生”的置信度为70%,而子识别结果“女生”的置信度为90%(>70%),从而可以选择子识别结果“女生”作为该待测目标的识别 结果。In addition, in practical applications, there may be cases where the obtained sub-recognition result with the highest degree of detail is included, and there are contradictions between the plurality of sub-recognition results, for example, the sub-recognition result with the highest level of detail obtained. Including "boys" and "girls", the sub-recognition results "boys" and "girls" correspond to the attribute types are "gender", but only one recognition result can be output under the same attribute type. At this time, the sub-recognition result with the highest degree of confidence can be selected from the sub-identification result with the highest degree of detail as the recognition result of the object to be tested, for example, the confidence of the sub-recognition result “boy” is 70%, and the sub-recognition result “ The confidence of the girl is 90% (>70%), so that the sub-recognition result “girl” can be selected as the target of the object to be tested. result.
在该实施方式中,通过融合至少两种信息源得到的子识别结果来生成待测目标的识别结果,能够进一步提升目标识别的详细程度。In this embodiment, by generating the sub-recognition result obtained by the at least two information sources to generate the recognition result of the object to be tested, the level of detail of the target recognition can be further improved.
在第二种实施方式中,可以采用“分级决策融合”的方式识别待测目标,即:分别基于所述至少两种信息源,根据优先级从高到低的顺序逐级获取所述待测目标每一属性类型对应的子判断结果以及每一子判断结果的子置信度,直至第一个子置信度满足预设条件的子判断结果出现时,输出所述第一个子置信度满足预设条件的子判断结果作为所述待测目标的识别结果。In the second embodiment, the target to be tested may be identified by means of a hierarchical decision fusion, that is, the test is obtained step by step according to the priority from the highest to the lowest based on the at least two information sources. The sub-judgment result corresponding to each attribute type of the target and the sub-confidence of each sub-judgment result, until the sub-judgment result in which the first sub-confidence satisfies the preset condition occurs, the output of the first sub-confidence satisfies the pre- The sub-judgment result of the condition is used as the recognition result of the object to be tested.
其中,所述“子判断结果”是指仅基于一种信息源分析得到的针对该待测目标在某一属性类型下的判断结果,每一信息源在每一属性类型下对应一个子判断结果。所述“子置信度”是指该子判断结果的可信程度,用于表征该子判断结果的可靠性。The “sub-judgment result” refers to a judgment result of the object to be tested under a certain attribute type based on only one information source analysis, and each information source corresponds to one sub-judgment result under each attribute type. . The "sub-confidence" refers to the degree of credibility of the sub-judgment result and is used to characterize the reliability of the sub-judgment result.
举例说明:假设待测目标为人,属性类型包括:“人名”、“性别”和“是否为人”,且,优先级关系为:L1(人名)>L2(性别)>L3(是否为人),采集到的信息包括图像信息和声音信息,则,首先分别基于该图像信息和声音信息获取“人名”对应的子判断结果及其置信度,假设基于图像信息获取到的子判断结果为“李四”并且“李四”的子置信度满足第一预设条件,而基于声音信息获取到的子判断结果为“张三”,但“张三”的子置信度不满足第二预设条件,说明基于声音信息无法识别出“人名”,此时,子判断结果“李四”是第一个子置信度满足预设条件(满足第一预设条件或第二预设条件)的子判断结果,从而,可以输出“李四”作为待测目标的识别结果。For example: Assume that the target to be tested is a person, and the attribute types include: “person name”, “gender” and “whether it is a person”, and the priority relationship is: L1 (person name)>L2 (gender)>L3 (whether it is a person), collection The obtained information includes image information and sound information. First, the sub-judgment result corresponding to the “person name” and the confidence thereof are first obtained based on the image information and the sound information, respectively, and it is assumed that the sub-judgment result obtained based on the image information is “Li Si”. And the sub-confidence of "Li Si" satisfies the first preset condition, and the sub-judgment result obtained based on the sound information is "Zhang San", but the sub-confidence of "Zhang San" does not satisfy the second preset condition, The "person name" cannot be recognized based on the sound information. At this time, the sub-judgment result "Li Si" is the sub-judgment result that the first sub-confidence satisfies the preset condition (meeting the first preset condition or the second preset condition). Thereby, "Li Si" can be output as the recognition result of the object to be tested.
此外,在一些实施例中,当同一时刻出现多个置信度满足预设条件,并且,不相同的子判断结果时,即,所述“第一个子置信度满足预设条件的子判断结果”包括多个时,可以选择这些子判断结果中置信度最高的子判断结果作为待测目标的识别结果。In addition, in some embodiments, when a plurality of confidence levels satisfy the preset condition at the same time, and different sub-judgment results, that is, the sub-judgment result that the first sub-confidence satisfies the preset condition When multiples are included, the sub-judgment result with the highest confidence among the sub-judgment results may be selected as the recognition result of the target to be tested.
在该实施方式中,通过分别基于至少两种信息源分级识别待测目标,只要其中一种信息源分析得到最优的识别结果(即,可靠且详细程度最高的识别结 果),就可以直接输出该最优的识别结果,能够提升目标识别的效率。In this embodiment, the target to be tested is hierarchically identified based on at least two information sources, respectively, as long as one of the information sources is analyzed to obtain an optimal recognition result (ie, the most reliable and detailed identification node). Therefore, the optimal recognition result can be directly output, and the efficiency of target recognition can be improved.
在第三种实施方式中,可以采用“分级融合决策”的方式识别待测目标,即:根据优先级由高到低的顺序,逐级从所述至少两种信息源中提取出所述待测目标每一属性类型对应的特征,并根据每一属性类型对应的特征获取每一属性类型对应的判断结果以及所述判断结果的置信度;直至第一个置信度满足预设条件的判断结果出现时,输出这个第一个置信度满足预设条件的判断结果作为所述待测目标的识别结果。In a third implementation manner, the target to be tested may be identified by using a “hierarchical fusion decision”, that is, the to-be-selected from the at least two information sources is sequentially stepped according to the order of priority from high to low. Measure the feature corresponding to each attribute type of the target, and obtain the judgment result corresponding to each attribute type and the confidence of the judgment result according to the feature corresponding to each attribute type; until the first confidence level satisfies the determination result of the preset condition When it appears, the judgment result that the first confidence level satisfies the preset condition is output as the recognition result of the object to be tested.
举例说明:假设待测目标为人,属性类型包括:“人名”、“性别”和“是否为人”,且,优先级关系为:L1(人名)>L2(性别)>L3(是否为人),采集到的信息包括图像信息和声音信息,则,可以首先从采集到的图像信息中提取出用来识别“人名”的一级特征A1,以及,从声音信息中提取出用来识别“人名”的一级特征A2,然后将这两类一级特征A1和A2融合在一起(比如,通过融合两类特征的神经网络分离器将A1和A2拼接在一起)生成特征A,然后根据特征A识别“人名”的一级判断结果以及该一级判断结果的一级置信度,如果该一级置信度满足一级预设条件,则输出该一级判断结果;否则,从采集到的图像信息中提取出用来识别“性别”的二级特征B1,以及,从声音信息中提取出用来识别“性别”的二级特征B2,然后将这两类二级特征B1和B2融合在一起生成特征B,然后根据特征B识别“性别”的二级判断结果以及该二级判断结果的二级置信度,如果该二级置信度满足二级预设条件,则输出该二级判断结果;否则,继续获取再下一级别的属性类型对应的判断结果及其置信度,如此循环,直至获取到置信度满足预设条件的判断结果。For example: Assume that the target to be tested is a person, and the attribute types include: “person name”, “gender” and “whether it is a person”, and the priority relationship is: L1 (person name)>L2 (gender)>L3 (whether it is a person), collection The obtained information includes image information and sound information, and then, the first-level feature A1 for identifying the "person name" may be first extracted from the collected image information, and the "person name" is extracted from the sound information. The first-level feature A2, and then the two types of first-level features A1 and A2 are merged together (for example, A1 and A2 are spliced together by a neural network separator that combines two types of features) to generate feature A, and then identify according to feature A. The first-level judgment result of the person name and the first-level confidence of the first-level judgment result, if the first-level confidence level satisfies the first-level preset condition, the first-level judgment result is output; otherwise, the collected image information is extracted A secondary feature B1 for identifying "gender", and a secondary feature B2 for identifying "gender" are extracted from the sound information, and then the two types of secondary features B1 and B2 are fused together to generate feature B. And then, according to feature B, the second-level judgment result of “gender” and the second-level confidence of the second-level judgment result are recognized, and if the second-level confidence level satisfies the second-level preset condition, the second-level judgment result is output; otherwise, continue The judgment result corresponding to the attribute type of the next level and the confidence thereof are obtained, and the loop is obtained until the judgment result that the confidence degree satisfies the preset condition is obtained.
在该实施方式中,通过逐级融合采集到的至少两种信息源的特征,能够丰富目标识别的判定信息,不仅能够提升目标识别的详细程度,还可以提升目标识别的效率。In this embodiment, by hierarchically merging the collected features of at least two information sources, the determination information of the target recognition can be enriched, which not only can improve the detail level of the target recognition, but also improve the efficiency of the target recognition.
通过上述技术方案可知,本申请实施例的有益效果在于:本申请实施例提供的目标识别方法通过采集至少两种信息源,并根据该至少两种信息源输出待测目标的识别结果,能够提升目标识别的详细程度和效率。 According to the foregoing technical solution, the object recognition method provided by the embodiment of the present application can improve the target identification method provided by the embodiment of the present application by collecting at least two types of information sources and outputting the recognition result of the object to be tested according to the at least two information sources. The level of detail and efficiency of target recognition.
实施例三 Embodiment 3
图3是本申请实施例提供的一种目标识别装置的结构示意图,请参阅图3,该目标识别装置3包括:FIG. 3 is a schematic structural diagram of a target recognition apparatus according to an embodiment of the present application. Referring to FIG. 3, the target identification apparatus 3 includes:
信息采集单元31,用于采集针对待测目标的信息,所述待测目标包括至少两种属性类型,所述至少两种属性类型之间设置有优先级关系;以及,The information collecting unit 31 is configured to collect information about a target to be tested, where the object to be tested includes at least two types of attributes, and a priority relationship is set between the at least two types of attributes;
识别单元32,用于基于所述信息输出所述待测目标的识别结果,所述识别结果为其中一种所述属性类型对应的判断结果,所述判断结果的置信度满足预设条件,并且,所述识别结果对应的属性类型在置信度满足所述预设条件的判断结果对应的属性类型中优先级最高。The identification unit 32 is configured to output a recognition result of the object to be tested based on the information, where the recognition result is a determination result corresponding to one of the attribute types, the confidence of the determination result satisfies a preset condition, and The attribute type corresponding to the recognition result has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
在本实施例中,当信息采集单元31采集到针对待测目标的信息时,通过识别单元32基于所述信息输出所述待测目标的识别结果。其中,所述待测目标包括至少两种属性类型,所述至少两种属性类型之间设置有优先级关系;所述识别结果为其中一种所述属性类型对应的判断结果,所述判断结果的置信度满足预设条件,并且,所述识别结果对应的属性类型在置信度满足所述预设条件的判断结果对应的属性类型中优先级最高。In the present embodiment, when the information collecting unit 31 collects information for the object to be tested, the recognition unit 32 outputs the recognition result of the object to be tested based on the information. The object to be tested includes at least two attribute types, and a priority relationship is set between the at least two attribute types; the recognition result is a determination result corresponding to one of the attribute types, and the determination result is The confidence level satisfies the preset condition, and the attribute type corresponding to the recognition result has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
其中,在一些实施例中,识别单元32具体用于:基于所述信息获取所述待测目标每一属性类型对应的判断结果以及每一判断结果的置信度;输出置信度满足预设条件的判断结果中优先级最高的属性类型对应的判断结果作为所述待测目标的识别结果。In some embodiments, the identifying unit 32 is specifically configured to: obtain, according to the information, a determination result corresponding to each attribute type of the object to be tested and a confidence level of each determination result; and output confidence that the preset condition meets the preset condition The judgment result corresponding to the attribute type having the highest priority among the judgment results is used as the recognition result of the object to be tested.
其中,在一些实施例中,识别单元32包括分析模块321和输出模块322;其中,分析模块321用于基于采集到的针对待测目标的信息,根据优先级从高到低的顺序逐级获取所述待测目标每一属性类型对应的判断结果以及每一判断结果的置信度;输出模块322用于当第一个置信度满足预设条件的判断结果出现时,输出所述第一个置信度满足预设条件的判断结果作为所述待测目标的识别结果。进一步地,在另一些实施例中,若信息采集单元31采集到的信息包括至少两种信息源,此时,分析模块321具体用于:根据优先级由高到低的顺序, 逐级从所述至少两种信息源中提取出所述待测目标每一属性类型对应的特征,并根据每一属性类型对应的特征获取每一属性类型对应的判断结果以及所述判断结果的置信度。In some embodiments, the identification unit 32 includes an analysis module 321 and an output module 322. The analysis module 321 is configured to acquire the information according to the target to be tested according to the priority from high to low. a determination result corresponding to each attribute type of the object to be tested and a confidence level of each determination result; the output module 322 is configured to output the first confidence when the first confidence level meets the preset condition The judgment result that satisfies the preset condition is used as the recognition result of the object to be tested. Further, in other embodiments, if the information collected by the information collection unit 31 includes at least two types of information sources, the analysis module 321 is specifically configured to: according to the order of priority from high to low, And extracting, from the at least two information sources, the features corresponding to each attribute type of the object to be tested, and obtaining the determination result corresponding to each attribute type and the determination result according to the feature corresponding to each attribute type Confidence.
此外,在又一些实施例中,当信息采集单元31采集到的信息包括至少两种信息源时,识别单元32具体用于:分别基于所述至少两种信息源,根据优先级从高到低的顺序逐级获取所述待测目标每一属性类型对应的子判断结果以及每一子判断结果的子置信度,直至第一个子置信度满足预设条件的子判断结果出现时,输出所述第一个子置信度满足预设条件的子判断结果作为所述待测目标的识别结果;或者,分别基于所述至少两种信息源获取所述待测目标的子识别结果,其中,每一信息源对应一个子识别结果;根据所述子识别结果输出所述待测目标的识别结果。In addition, in some embodiments, when the information collected by the information collecting unit 31 includes at least two types of information sources, the identifying unit 32 is specifically configured to: based on the at least two information sources respectively, according to the priority from high to low. The sub-judging result corresponding to each attribute type of the object to be tested and the sub-confidence of each sub-judgment result are obtained step by step until the sub-judgment result of the first sub-confidence satisfying the preset condition appears, and the output is a sub-judgment result that satisfies a preset condition as a recognition result of the object to be tested; or, a sub-recognition result of the object to be tested is acquired based on the at least two information sources, wherein each An information source corresponds to a sub-recognition result; and the recognition result of the object to be tested is output according to the sub-recognition result.
再者,在一些实施例中,该目标识别装置3还包括:Moreover, in some embodiments, the target recognition device 3 further includes:
交互单元33,用于发送与所述识别结果对应的交互信号。The interaction unit 33 is configured to send an interaction signal corresponding to the recognition result.
需要说明的是,由于所述目标识别装置与上述方法实施例中的目标识别方法基于相同的发明构思,因此,上述方法实施例的相应内容以及有益效果同样适用于本装置实施例,此处不再详述。It should be noted that, since the target recognition device and the target recognition method in the foregoing method embodiments are based on the same inventive concept, the corresponding content and the beneficial effects of the foregoing method embodiments are also applicable to the device embodiment, and More details.
通过上述技术方案可知,本申请实施例的有益效果在于:本申请实施例提供的目标识别装置通过根据对待测目标的描述的详细程度为待测目标的属性划分多个具有优先级顺序的属性类型,并且在识别的过程中,由识别单元获取每一属性类型下的判断结果的置信度,继而根据实际识别情况输出置信度满足预设条件的判断结果中优先级最高的属性类型对应的判断结果作为所述待测目标的识别结果,能够在不同的识别场景下,确保输出的识别结果的可靠性,同时,尽可能地输出更详细的识别结果,即,使得最终得到的识别结果能够在可靠性和详细程度之间达到折中,从而提升用户体验。According to the foregoing technical solution, the object recognition apparatus provided by the embodiment of the present application divides the attribute type of the priority order into the attribute of the object to be tested according to the detailed degree of the description of the object to be tested. And in the process of identification, the confidence unit obtains the confidence of the judgment result under each attribute type, and then outputs the judgment result corresponding to the attribute type with the highest priority among the judgment results satisfying the preset condition according to the actual recognition situation. As the recognition result of the object to be tested, it is possible to ensure the reliability of the output recognition result under different recognition scenarios, and at the same time, output a more detailed recognition result as much as possible, that is, the final recognition result can be reliably A compromise between sex and level of detail improves the user experience.
实施例四Embodiment 4
图4是本申请实施例提供的一种智能终端的硬件结构示意图,该智能终端 400可以是任意类型的智能终端,如:机器人、导盲眼镜、智能头盔、智能手机、平板电脑、服务器等,能够执行上述方法实施例一和/或实施例二所提供的目标识别方法。4 is a schematic structural diagram of hardware of an intelligent terminal according to an embodiment of the present application, where the smart terminal The 400 can be any type of smart terminal, such as a robot, a blind eyeglass, a smart helmet, a smart phone, a tablet, a server, etc., and can perform the target recognition method provided by the first embodiment and/or the second embodiment.
具体地,请参阅图4,该智能终端400包括:Specifically, referring to FIG. 4, the smart terminal 400 includes:
一个或多个处理器401以及存储器402,图4中以一个处理器401为例。One or more processors 401 and memory 402 are exemplified by one processor 401 in FIG.
处理器401和存储器402可以通过总线或者其他方式连接,图4中以通过总线连接为例。The processor 401 and the memory 402 can be connected by a bus or other means, and the connection by a bus is taken as an example in FIG.
存储器402作为一种非暂态计算机可读存储介质,可用于存储非暂态软件程序、非暂态性计算机可执行程序以及模块,如本申请实施例中的目标识别方法对应的程序指令/模块(例如,附图3所示的信息采集单元31、识别单元32和交互单元33)。处理器401通过运行存储在存储器402中的非暂态软件程序、指令以及模块,从而执行目标识别装置的各种功能应用以及数据处理,即实现上述任一方法实施例的目标识别方法。The memory 402 is used as a non-transitory computer readable storage medium, and can be used for storing a non-transitory software program, a non-transitory computer executable program, and a module, such as a program instruction/module corresponding to the target recognition method in the embodiment of the present application. (For example, the information collecting unit 31, the identifying unit 32, and the interactive unit 33 shown in Fig. 3). The processor 401 executes various functional applications and data processing of the target recognition device by executing non-transitory software programs, instructions, and modules stored in the memory 402, that is, implementing the target recognition method of any of the above method embodiments.
存储器402可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据目标识别装置的使用所创建的数据等。此外,存储器402可以包括高速随机存取存储器,还可以包括非暂态存储器,例如至少一个磁盘存储器件、闪存器件、或其他非暂态固态存储器件。在一些实施例中,存储器402可选包括相对于处理器401远程设置的存储器,这些远程存储器可以通过网络连接至智能终端400。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 402 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the target identification device, and the like. Moreover, memory 402 can include high speed random access memory, and can also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, memory 402 can optionally include memory remotely located relative to processor 401, which can be connected to smart terminal 400 over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
所述一个或者多个模块存储在所述存储器402中,当被所述一个或者多个处理器401执行时,执行上述任意方法实施例中的目标识别方法,例如,执行以上描述的图1中的方法步骤110至步骤120,图2中的方法步骤210至步骤220,实现图3中的单元31-33的功能。The one or more modules are stored in the memory 402, and when executed by the one or more processors 401, perform a target recognition method in any of the above method embodiments, for example, performing the above described FIG. Method step 110 to step 120, method step 210 to step 220 in FIG. 2, implement the functions of units 31-33 in FIG.
实施例五Embodiment 5
本申请实施例还提供了一种非暂态计算机可读存储介质,所述非暂态计算 机可读存储介质存储有计算机可执行指令,该计算机可执行指令被一个或多个处理器执行,例如被图4中的一个处理器401执行,可使得上述一个或多个处理器执行上述任意方法实施例中的目标识别方法,例如,执行以上描述的图1中的方法步骤110至步骤120,图2中的方法步骤210至步骤220,实现图3中的单元31-33的功能。The embodiment of the present application further provides a non-transitory computer readable storage medium, the non-transient computing The machine readable storage medium stores computer executable instructions that are executed by one or more processors, such as by one of the processors 401 of FIG. 4, such that the one or more processors perform any of the above The object recognition method in the method embodiment, for example, performs the method steps 110 to 120 in FIG. 1 described above, and the method steps 210 to 220 in FIG. 2 implement the functions of the units 31-33 in FIG.
以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
通过以上的实施方式的描述,本领域普通技术人员可以清楚地了解到各实施方式可借助软件加通用硬件平台的方式来实现,当然也可以通过硬件。本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一非暂态计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)或随机存储记忆体(Random Access Memory,RAM)等。Through the description of the above embodiments, those skilled in the art can clearly understand that the various embodiments can be implemented by means of software plus a general hardware platform, and of course, by hardware. A person skilled in the art can understand that all or part of the process of implementing the above embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a non-transitory computer readable storage medium. The program, when executed, may include the flow of an embodiment of the methods as described above. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
上述产品可执行本申请实施例所提供的目标识别方法,具备执行目标识别方法相应的功能模块和有益效果。未在本实施例中详尽描述的技术细节,可参见本申请实施例所提供的目标识别方法。The above product can perform the target recognition method provided by the embodiment of the present application, and has the corresponding functional modules and beneficial effects of performing the target recognition method. For the technical details that are not described in detail in this embodiment, refer to the object recognition method provided by the embodiment of the present application.
最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;在本申请的思路下,以上实施例或者不同实施例中的技术特征之间也可以进行组合,步骤可以以任意顺序实现,并存在如上所述的本申请的不同方面的许多其它变化,为了简明,它们没有在细节中提供;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的范围。 Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application, and are not limited thereto; in the idea of the present application, the technical features in the above embodiments or different embodiments may also be combined. The steps may be carried out in any order, and there are many other variations of the various aspects of the present application as described above, which are not provided in the details for the sake of brevity; although the present application has been described in detail with reference to the foregoing embodiments, The skilled person should understand that the technical solutions described in the foregoing embodiments may be modified, or some of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the embodiments of the present application. The scope of the technical solution.

Claims (17)

  1. 一种目标识别方法,应用于智能终端,其特征在于,包括:A target recognition method is applied to an intelligent terminal, and is characterized in that:
    采集针对待测目标的信息,所述待测目标包括至少两种属性类型,所述至少两种属性类型之间设置有优先级关系;Collecting information for the object to be tested, the object to be tested includes at least two types of attributes, and a priority relationship is set between the at least two types of attributes;
    基于所述信息输出所述待测目标的识别结果,所述识别结果为其中一种所述属性类型对应的判断结果,所述判断结果的置信度满足预设条件,并且,所述识别结果对应的属性类型在置信度满足所述预设条件的判断结果对应的属性类型中优先级最高。And outputting, according to the information, the recognition result of the object to be tested, the recognition result is a determination result corresponding to one of the attribute types, the confidence of the determination result satisfies a preset condition, and the recognition result corresponds to The attribute type has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
  2. 根据权利要求1所述的目标识别方法,其特征在于,所述基于所述信息输出所述待测目标的识别结果,包括:The object recognition method according to claim 1, wherein the outputting the recognition result of the object to be tested based on the information comprises:
    基于所述信息获取所述待测目标每一属性类型对应的判断结果以及每一判断结果的置信度;Obtaining, according to the information, a determination result corresponding to each attribute type of the object to be tested and a confidence level of each determination result;
    输出置信度满足预设条件的判断结果中优先级最高的属性类型对应的判断结果作为所述待测目标的识别结果。The determination result corresponding to the attribute type having the highest priority among the determination results satisfying the preset condition is used as the recognition result of the object to be tested.
  3. 根据权利要求1所述的目标识别方法,其特征在于,所述基于所述信息输出所述待测目标的识别结果,包括:The object recognition method according to claim 1, wherein the outputting the recognition result of the object to be tested based on the information comprises:
    基于所述信息,根据优先级从高到低的顺序逐级获取所述待测目标每一属性类型对应的判断结果以及每一判断结果的置信度,直至第一个置信度满足预设条件的判断结果出现时,输出所述第一个置信度满足预设条件的判断结果作为所述待测目标的识别结果。Determining, according to the information, the determination result corresponding to each attribute type of the object to be tested and the confidence of each determination result according to the priority from high to low, until the first confidence meets the preset condition When the judgment result appears, the determination result that the first confidence level satisfies the preset condition is output as the recognition result of the object to be tested.
  4. 根据权利要求3所述的目标识别方法,其特征在于,所述信息包括至少两种信息源,则,所述基于所述信息,根据优先级从高到低的顺序逐级获取所述待测目标每一属性类型对应的判断结果以及每一判断结果的置信度,包括:The object recognition method according to claim 3, wherein the information includes at least two types of information sources, and the obtaining, according to the information, the step-by-step acquisition according to the priority from high to low The judgment result corresponding to each attribute type of the target and the confidence of each judgment result include:
    根据优先级由高到低的顺序,逐级从所述至少两种信息源中提取出所述待测目标每一属性类型对应的特征,并根据每一属性类型对应的特征获取每一属 性类型对应的判断结果以及所述判断结果的置信度。Extracting, according to the order of priority from high to low, the features corresponding to each attribute type of the object to be tested are extracted step by step, and acquiring each genus according to the feature corresponding to each attribute type The judgment result corresponding to the sex type and the confidence of the judgment result.
  5. 根据权利要求1所述的目标识别方法,其特征在于,所述信息包括至少两种信息源,则,所述基于所述信息输出所述待测目标的识别结果,包括:The object recognition method according to claim 1, wherein the information includes at least two types of information sources, and the outputting the recognition result of the object to be tested based on the information includes:
    分别基于所述至少两种信息源,根据优先级从高到低的顺序逐级获取所述待测目标每一属性类型对应的子判断结果以及每一子判断结果的子置信度,直至第一个子置信度满足预设条件的子判断结果出现时,输出所述第一个子置信度满足预设条件的子判断结果作为所述待测目标的识别结果。Obtaining, according to the at least two information sources, the sub-judgment result corresponding to each attribute type of the object to be tested and the sub-confidence of each sub-judgment result, according to the order of priority from high to low, respectively, until the first When the sub-judgment result whose sub-confidence satisfies the preset condition appears, the sub-judgment result whose first sub-confidence satisfies the preset condition is output as the recognition result of the object to be tested.
  6. 根据权利要求1所述的目标识别方法,其特征在于,所述信息包括至少两种信息源,则,所述基于所述信息输出所述待测目标的识别结果,包括:The object recognition method according to claim 1, wherein the information includes at least two types of information sources, and the outputting the recognition result of the object to be tested based on the information includes:
    分别基于所述至少两种信息源获取所述待测目标的子识别结果,其中,每一信息源对应一个子识别结果;Obtaining a sub-recognition result of the object to be tested, respectively, based on the at least two information sources, where each information source corresponds to one sub-recognition result;
    根据所述子识别结果输出所述待测目标的识别结果。And outputting the recognition result of the object to be tested according to the sub-identification result.
  7. 根据权利要求1-6任一项所述的目标识别方法,其特征在于,所述方法还包括:The object recognition method according to any one of claims 1 to 6, wherein the method further comprises:
    发送与所述识别结果对应的交互信号。An interaction signal corresponding to the recognition result is transmitted.
  8. 一种目标识别装置,应用于智能终端,其特征在于,包括:A target recognition device is applied to a smart terminal, and includes:
    信息采集单元,用于采集针对待测目标的信息,所述待测目标包括至少两种属性类型,所述至少两种属性类型之间设置有优先级关系;An information collecting unit, configured to collect information about a target to be tested, where the object to be tested includes at least two types of attributes, and a priority relationship is set between the at least two types of attributes;
    识别单元,用于基于所述信息输出所述待测目标的识别结果,所述识别结果为其中一种所述属性类型对应的判断结果,所述判断结果的置信度满足预设条件,并且,所述识别结果对应的属性类型在置信度满足所述预设条件的判断结果对应的属性类型中优先级最高。a recognition unit, configured to output a recognition result of the object to be tested based on the information, where the recognition result is a determination result corresponding to one of the attribute types, the confidence of the determination result satisfies a preset condition, and The attribute type corresponding to the recognition result has the highest priority among the attribute types corresponding to the determination result that the confidence degree satisfies the preset condition.
  9. 根据权利要求8所述的目标识别装置,其特征在于,所述识别单元具体用于:The object recognition device according to claim 8, wherein the identification unit is specifically configured to:
    基于所述信息获取所述待测目标每一属性类型对应的判断结果以及每一判 断结果的置信度;Obtaining a judgment result corresponding to each attribute type of the object to be tested and each sentence based on the information The confidence of the result of the break;
    输出置信度满足预设条件的判断结果中优先级最高的属性类型对应的判断结果作为所述待测目标的识别结果。The determination result corresponding to the attribute type having the highest priority among the determination results satisfying the preset condition is used as the recognition result of the object to be tested.
  10. 根据权利要求8所述的目标识别装置,其特征在于,所述识别单元包括:The object recognition device according to claim 8, wherein the identification unit comprises:
    分析模块,用于基于所述信息,根据优先级从高到低的顺序逐级获取所述待测目标每一属性类型对应的判断结果以及每一判断结果的置信度;An analysis module, configured to obtain, according to the information, a judgment result corresponding to each attribute type of the object to be tested and a confidence level of each determination result according to a priority from high to low;
    输出模块,用于当第一个置信度满足预设条件的判断结果出现时,输出所述第一个置信度满足预设条件的判断结果作为所述待测目标的识别结果。And an output module, configured to output, when the first confidence level meets the preset condition, the determination result that the first confidence meets the preset condition is used as the recognition result of the object to be tested.
  11. 根据权利要求10所述的目标识别装置,其特征在于,所述信息包括至少两种信息源,则,所述分析模块具体用于:The target recognition device according to claim 10, wherein the information comprises at least two types of information sources, and the analysis module is specifically configured to:
    根据优先级由高到低的顺序,逐级从所述至少两种信息源中提取出所述待测目标每一属性类型对应的特征,并根据每一属性类型对应的特征获取每一属性类型对应的判断结果以及所述判断结果的置信度。Extracting, according to the order of priority from high to low, the features corresponding to each attribute type of the object to be tested are obtained step by step, and acquiring each attribute type according to the feature corresponding to each attribute type Corresponding judgment result and confidence of the judgment result.
  12. 根据权利要求8所述的目标识别装置,其特征在于,所述信息包括至少两种信息源,则,所述识别单元具体用于:The object recognition apparatus according to claim 8, wherein the information includes at least two types of information sources, and the identification unit is specifically configured to:
    分别基于所述至少两种信息源,根据优先级从高到低的顺序逐级获取所述待测目标每一属性类型对应的子判断结果以及每一子判断结果的子置信度,直至第一个子置信度满足预设条件的子判断结果出现时,输出所述第一个子置信度满足预设条件的子判断结果作为所述待测目标的识别结果。Obtaining, according to the at least two information sources, the sub-judgment result corresponding to each attribute type of the object to be tested and the sub-confidence of each sub-judgment result, according to the order of priority from high to low, respectively, until the first When the sub-judgment result whose sub-confidence satisfies the preset condition appears, the sub-judgment result whose first sub-confidence satisfies the preset condition is output as the recognition result of the object to be tested.
  13. 根据权利要求8所述的目标识别装置,其特征在于,所述信息包括至少两种信息源,则,所述识别单元具体用于:The object recognition apparatus according to claim 8, wherein the information includes at least two types of information sources, and the identification unit is specifically configured to:
    分别基于所述至少两种信息源获取所述待测目标的子识别结果,其中,每一信息源对应一个子识别结果;Obtaining a sub-recognition result of the object to be tested, respectively, based on the at least two information sources, where each information source corresponds to one sub-recognition result;
    根据所述子识别结果输出所述待测目标的识别结果。 And outputting the recognition result of the object to be tested according to the sub-identification result.
  14. 根据权利要求8-13任一项所述的目标识别装置,其特征在于,所述目标识别装置还包括:The object recognition device according to any one of claims 8 to 13, wherein the object recognition device further comprises:
    交互单元,用于发送与所述识别结果对应的交互信号。And an interaction unit, configured to send an interaction signal corresponding to the recognition result.
  15. 一种智能终端,其特征在于,包括:An intelligent terminal, comprising:
    至少一个处理器;以及,At least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如权利要求1-7任一项所述的目标识别方法。The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the method of any of claims 1-7 Target identification method.
  16. 一种非暂态计算机可读存储介质,其特征在于,所述非暂态计算机可读存储介质存储有计算机可执行指令,所述计算机可执行指令用于使智能终端执行如权利要求1-7任一项所述的目标识别方法。A non-transitory computer readable storage medium, characterized in that the non-transitory computer readable storage medium stores computer executable instructions for causing a smart terminal to perform the claims 1-7 The target recognition method described in any one of the above.
  17. 一种计算机程序产品,其特征在于,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被智能终端执行时,使所述智能终端执行如权利要求1-7任一项所述的目标识别方法。 A computer program product, comprising: a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program instructions are executed by a smart terminal, The smart terminal is caused to perform the object recognition method according to any one of claims 1-7.
PCT/CN2017/101967 2017-09-15 2017-09-15 Target recognition method and apparatus, and intelligent terminal WO2019051814A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201780002292.4A CN107995982B (en) 2017-09-15 2017-09-15 A kind of target identification method, device and intelligent terminal
PCT/CN2017/101967 WO2019051814A1 (en) 2017-09-15 2017-09-15 Target recognition method and apparatus, and intelligent terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/101967 WO2019051814A1 (en) 2017-09-15 2017-09-15 Target recognition method and apparatus, and intelligent terminal

Publications (1)

Publication Number Publication Date
WO2019051814A1 true WO2019051814A1 (en) 2019-03-21

Family

ID=62039956

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/101967 WO2019051814A1 (en) 2017-09-15 2017-09-15 Target recognition method and apparatus, and intelligent terminal

Country Status (2)

Country Link
CN (1) CN107995982B (en)
WO (1) WO2019051814A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110866466A (en) * 2019-10-30 2020-03-06 平安科技(深圳)有限公司 Face recognition method, face recognition device, storage medium and server
CN111833298A (en) * 2020-06-04 2020-10-27 石家庄喜高科技有限责任公司 Skeletal development grade detection method and terminal equipment
CN112118278A (en) * 2019-06-04 2020-12-22 杭州海康威视系统技术有限公司 Computing node access method and device, electronic equipment and machine-readable storage medium
CN113704266A (en) * 2021-08-31 2021-11-26 深信服科技股份有限公司 Asset information processing method and device, electronic equipment and storage medium

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110894658B (en) * 2018-08-23 2023-03-21 无锡小天鹅电器有限公司 Washing control method and device and clothes treatment device
CN112071009B (en) * 2019-06-11 2023-01-03 中国石油化工股份有限公司 Optical fiber pipeline early warning system and method thereof
CN112434795A (en) * 2020-12-01 2021-03-02 中国科学院空天信息创新研究院 Novel three-dimensional artificial neuron device and artificial neural network recognition system
CN112562221A (en) * 2020-12-02 2021-03-26 支付宝(杭州)信息技术有限公司 Terminal and method for supporting face recognition
CN114627098A (en) * 2022-03-28 2022-06-14 上海商汤智能科技有限公司 Detection result processing method and device, equipment and medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070016581A1 (en) * 2005-07-13 2007-01-18 Fujitsu Limited Category setting support method and apparatus
CN101452529A (en) * 2007-12-07 2009-06-10 索尼株式会社 Information processing apparatus and information processing method, and computer program
CN105844283A (en) * 2015-01-16 2016-08-10 阿里巴巴集团控股有限公司 Method for identifying category of image, image search method and image search device
CN106021060A (en) * 2016-04-29 2016-10-12 乐视控股(北京)有限公司 Method and device for determining priority of user behavior message
CN106326928A (en) * 2016-08-24 2017-01-11 四川九洲电器集团有限责任公司 Target identification method and equipment
CN106934817A (en) * 2017-02-23 2017-07-07 中国科学院自动化研究所 Based on multiattribute multi-object tracking method and device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104143079B (en) * 2013-05-10 2016-08-17 腾讯科技(深圳)有限公司 The method and system of face character identification
CN103914683A (en) * 2013-12-31 2014-07-09 闻泰通讯股份有限公司 Gender identification method and system based on face image
CN103824054B (en) * 2014-02-17 2018-08-07 北京旷视科技有限公司 A kind of face character recognition methods based on cascade deep neural network
US9319416B2 (en) * 2014-07-03 2016-04-19 Alcatel Lucent Priority based radius authentication
CN107145857B (en) * 2017-04-29 2021-05-04 深圳市深网视界科技有限公司 Face attribute recognition method and device and model establishment method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070016581A1 (en) * 2005-07-13 2007-01-18 Fujitsu Limited Category setting support method and apparatus
CN101452529A (en) * 2007-12-07 2009-06-10 索尼株式会社 Information processing apparatus and information processing method, and computer program
CN105844283A (en) * 2015-01-16 2016-08-10 阿里巴巴集团控股有限公司 Method for identifying category of image, image search method and image search device
CN106021060A (en) * 2016-04-29 2016-10-12 乐视控股(北京)有限公司 Method and device for determining priority of user behavior message
CN106326928A (en) * 2016-08-24 2017-01-11 四川九洲电器集团有限责任公司 Target identification method and equipment
CN106934817A (en) * 2017-02-23 2017-07-07 中国科学院自动化研究所 Based on multiattribute multi-object tracking method and device

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112118278A (en) * 2019-06-04 2020-12-22 杭州海康威视系统技术有限公司 Computing node access method and device, electronic equipment and machine-readable storage medium
CN112118278B (en) * 2019-06-04 2023-07-04 杭州海康威视系统技术有限公司 Computing node access method, device, electronic equipment and machine-readable storage medium
CN110866466A (en) * 2019-10-30 2020-03-06 平安科技(深圳)有限公司 Face recognition method, face recognition device, storage medium and server
CN110866466B (en) * 2019-10-30 2023-12-26 平安科技(深圳)有限公司 Face recognition method, device, storage medium and server
CN111833298A (en) * 2020-06-04 2020-10-27 石家庄喜高科技有限责任公司 Skeletal development grade detection method and terminal equipment
CN111833298B (en) * 2020-06-04 2022-08-19 石家庄喜高科技有限责任公司 Skeletal development grade detection method and terminal equipment
CN113704266A (en) * 2021-08-31 2021-11-26 深信服科技股份有限公司 Asset information processing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN107995982A (en) 2018-05-04
CN107995982B (en) 2019-03-22

Similar Documents

Publication Publication Date Title
WO2019051814A1 (en) Target recognition method and apparatus, and intelligent terminal
US10614310B2 (en) Behavior recognition
WO2022105243A1 (en) Event detection method, apparatus, electronic device, and storage medium
KR102570278B1 (en) Apparatus and method for generating training data used to training student model from teacher model
CN105122270B (en) The method and system of people is counted using depth transducer
WO2019196205A1 (en) Foreign language teaching evaluation information generating method and apparatus
WO2019056267A1 (en) Hierarchical interactive decision making method, interactive terminal, and cloud server
JP7104779B2 (en) Target recognition method, device and intelligent terminal
CN106648078B (en) Multi-mode interaction method and system applied to intelligent robot
US20170185913A1 (en) System and method for comparing training data with test data
WO2021103868A1 (en) Method for structuring pedestrian information, device, apparatus and storage medium
CN115797736B (en) Training method, device, equipment and medium for target detection model and target detection method, device, equipment and medium
CN111652087A (en) Car checking method and device, electronic equipment and storage medium
CN109286848B (en) Terminal video information interaction method and device and storage medium
KR102104548B1 (en) The visual detecting system and visual detecting method for operating by the same
CN114093022A (en) Activity detection device, activity detection system, and activity detection method
US11194303B2 (en) Method and system for anomaly detection and notification through profiled context
WO2020007191A1 (en) Method and apparatus for living body recognition and detection, and medium and electronic device
CN102937412B (en) Foreign matter blocking detection and intelligent alarm method for camera devices
KR20190018274A (en) Method and apparatus for recognizing a subject existed in an image based on temporal movement or spatial movement of a feature point of the image
CN109598712A (en) Quality determining method, device, server and the storage medium of plastic foam cutlery box
CN110019837A (en) The generation method and device, computer equipment and readable medium of user's portrait
CN111291749A (en) Gesture recognition method and device and robot
KR20130013462A (en) Foreground extraction apparatus and method using ccb and mt lbp
Dai et al. Multi-scale video flame detection for early fire warning based on deep learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17925019

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17925019

Country of ref document: EP

Kind code of ref document: A1