WO2019056267A1 - Hierarchical interactive decision making method, interactive terminal, and cloud server - Google Patents

Hierarchical interactive decision making method, interactive terminal, and cloud server Download PDF

Info

Publication number
WO2019056267A1
WO2019056267A1 PCT/CN2017/102746 CN2017102746W WO2019056267A1 WO 2019056267 A1 WO2019056267 A1 WO 2019056267A1 CN 2017102746 W CN2017102746 W CN 2017102746W WO 2019056267 A1 WO2019056267 A1 WO 2019056267A1
Authority
WO
WIPO (PCT)
Prior art keywords
attribute
module
grading
feature
target object
Prior art date
Application number
PCT/CN2017/102746
Other languages
French (fr)
Chinese (zh)
Inventor
廉士国
刘兆祥
王宁
Original Assignee
达闼科技(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 达闼科技(北京)有限公司 filed Critical 达闼科技(北京)有限公司
Priority to PCT/CN2017/102746 priority Critical patent/WO2019056267A1/en
Priority to CN201780001795.XA priority patent/CN107820619B/en
Publication of WO2019056267A1 publication Critical patent/WO2019056267A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads
    • G06V20/584Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads of vehicle lights or traffic lights
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/174Facial expression recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification

Definitions

  • the present application relates to the field of robot interaction, and specifically relates to a hierarchical interaction decision method, an interactive terminal, and a cloud server.
  • the blind person is given a reminder voice, for example, when the name or gender is recognized, the voice prompt “Before your friend Xiao Ming” and “There is a woman in front”.
  • the welcome robot recognizes the guests according to the machine vision and greets them by voice. For example, “Hello! Dear VIP customer Zhang Xiaoming”, “Hello, Ms!”. This kind of interaction can bring a user-friendly experience and improve service quality.
  • the prior art interactive terminal directly interacts according to various object features recognized by the machine vision, so that interactions based on advanced machine vision may also occur in some scenes, for example, due to light, angle or occlusion. People can't ensure that male and female, unable to detect human expression and age, and other machine intelligence can't judge, if the interactive user is a female, and the interactive terminal says "Hello, sir! will be caused by gender recognition errors. .
  • the present application provides a hierarchical interaction decision method, an interactive terminal, and a cloud server, which pre-sets the attribute classification of the target object, and prioritizes the attribute rankings to form a multi-level hierarchical neural network. Attribute judgment based on the target object features recognized by machine vision and robot semantic understanding Break and output the interactive decision-making basis for the current target object information, and try to identify the more detailed attributes of the person and the object, making the interactive terminal more intelligent and flexible, and improving the user experience.
  • the present application provides the following technical solutions.
  • an embodiment of the present application provides a hierarchical interaction decision method, including the following steps:
  • the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interactive decision.
  • an embodiment of the present application further provides an interaction terminal, including an information acquisition module, an identification module, an attribute module, a determination module, and an output module.
  • the information acquiring module is configured to acquire target object information, and the identifying module is configured to identify the target object feature;
  • the attribute module is configured to obtain a corresponding attribute classification according to the target object feature, and prioritize the attribute classification;
  • the determining module is configured to perform a stepwise attribute determination on the target object feature according to the prioritized order of the attribute ranking, and the target object feature is hierarchically migrated to a higher priority attribute when the current attribute classification is satisfied by the classification criterion;
  • the output module is used to output the current attribute grading and the subordinate attribute grading judgment result as the basis of the interaction decision.
  • the embodiment of the present application further provides a cloud server, including a receiving module, an attribute module, a determining module, an output module, and a sending module.
  • the receiving module is configured to receive a target object feature that is sent by the interaction terminal and is identified according to the acquired target object information;
  • the attribute module is configured to obtain a corresponding attribute classification according to the target object feature, and prioritize the attribute classification;
  • the judging module is configured to perform hierarchical attribute determination based on the priority order of the attribute ranking based on the target object feature, and the target object module is hierarchically migrated to a higher priority attribute when the current attribute classification grading standard is satisfied;
  • the output module is used for output when the target object module does not meet the grading criteria of the current attribute hierarchy.
  • the current attribute grading and the judgment result of the subordinate attribute grading are used as the basis for the interactive decision;
  • the sending module is configured to send the basis.
  • the embodiment of the present application further provides an electronic device, including:
  • At least one processor and,
  • a memory communicatively coupled to the at least one processor, a communication component, an audio data collector, and a video data collector;
  • the memory stores instructions executable by the at least one processor, the instructions being invoked by the at least one processor to invoke data of the audio data collector and the video data collector, and establishing a connection with the cloud server through the communication component to enable the At least one processor is capable of performing the method as described above.
  • the embodiment of the present application further provides a non-transitory computer readable storage medium, where the computer-readable storage medium is stored, where the computer-executable instructions are used to cause a computer to execute the above The method described.
  • the embodiment of the present application further provides a computer program product, the computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when When the program instructions are executed by the computer, the computer is caused to perform the method as described above.
  • the utility model has the beneficial effects that the hierarchical interaction decision method, the interaction terminal and the cloud server provided by the embodiments of the present application pre-set a reasonable attribute classification of the target object, prioritize the attribute classifications, and form a multi-level hierarchical neural network.
  • the target object features are judged step by step, and the optimal decision basis for the current target object information is outputted, so that the interactive terminal is more intelligent and flexible, and the user experience is improved.
  • the target object such as the attributes of the person and the object, is determined as much as possible, and more interactive decision-making basis is provided for the actual interactive application scenario, thereby improving the user's deep interactive experience.
  • FIG. 1 is a system architecture diagram of hierarchical interaction decision provided by an embodiment of the present application.
  • FIG. 2 is a block diagram of an interaction terminal provided by an embodiment of the present application.
  • FIG. 3 is a main flowchart of hierarchical interaction decision of an interactive terminal according to an embodiment of the present application.
  • FIG. 4 is an overall flowchart of hierarchical interaction decision of an interactive terminal according to an embodiment of the present application.
  • FIG. 5 is a flowchart of step-by-step determination of a face recognition embodiment of an interactive terminal according to an embodiment of the present application
  • FIG. 6 is a multi-level hierarchical neural network diagram of an interaction terminal according to an embodiment of the present application.
  • FIG. 7 is a flowchart of step-by-step determination of a vehicle identification embodiment of an interactive terminal according to an embodiment of the present application.
  • FIG. 8 is a flowchart of step-by-step judgment of an audio recognition embodiment of an interactive terminal according to an embodiment of the present application
  • FIG. 9 is a schematic diagram of a cloud server module according to an embodiment of the present application.
  • FIG. 10 is a flowchart of a method for implementing hierarchical interaction decision on a cloud server side provided by an embodiment of the present application
  • FIG. 11 is a schematic diagram showing the hardware structure of an electronic device according to the hierarchical interaction decision method provided by the embodiment of the present application.
  • the hierarchical interaction decision method, the interaction terminal and the cloud server provided by the embodiments of the present application pre-set attribute classification of the target object, prioritize the attribute classification, form a multi-level hierarchical neural network, and recognize and recognize according to machine vision and robot semantics.
  • the target object feature performs attribute judgment step by step, and outputs the interaction decision basis with the most information about the current target object, so that the interactive terminal is more intelligent and flexible, and improves the user experience.
  • the hierarchical interactive decision content of the present application includes: acquiring target object information, identifying the target object feature; acquiring corresponding attribute classification according to the target object feature, prioritizing the attribute ranking; and ranking the priority order according to the attribute classification, based on the target
  • the object feature performs step-by-step attribute judgment, and when the current attribute classification grading standard is satisfied, the attribute is graded to a higher priority attribute; when the current attribute grading grading standard is not satisfied, the current attribute grading and the following priority attribute grading result are outputted.
  • the basis for interactive decision making is performed by: acquiring target object information, identifying the target object feature; acquiring corresponding attribute classification according to the target object feature, prioritizing the attribute ranking; and ranking the priority order according to the attribute classification, based on the target
  • the object feature performs step-by-step attribute judgment, and when the current attribute classification grading standard is satisfied, the attribute is graded to a higher priority attribute; when the current attribute grading grading standard is not satisfied, the current
  • the prioritization of the attribute ranking may be starting from a primary attribute to an advanced attribute, and starting from the primary attribute to the advanced attribute based on the target object feature for progressive attribute determination. It can be understood that the stepwise attribute judgment based on the target object feature may also be judged step by step from the advanced attribute to the primary attribute, as long as the deeper result can be recognized as the result output of the system, thereby identifying as much as possible. More detailed attributes of characters and objects to support deep interaction.
  • the hierarchical interactive decision-making method, the interactive terminal and the cloud server of the present application are machine-optimized decision-making methods based on confidence priority, which realizes optimal recognition of characters and objects, and tries to identify more detailed attributes of characters and objects, thereby realizing terminal and person. Friendly interaction between.
  • each interaction device is connected to the cloud server 100.
  • the interactive terminal 100 can be smart glasses 110, can be a robot 120, can be a smart terminal 130, or can be a smart helmet 140 or the like.
  • the interactive terminal collects information about the opposite target object, such as picture image information or audio and audio information; and hierarchically and adaptively identifies the target object and the target object feature from the target object information; based on the identified target object and the target object The feature, the interactive terminal device sends corresponding interaction information to the person; the user responds to the interactive terminal and the like.
  • information about the opposite target object such as picture image information or audio and audio information
  • hierarchically and adaptively identifies the target object and the target object feature from the target object information based on the identified target object and the target object
  • the interactive terminal device sends corresponding interaction information to the person; the user responds to the interactive terminal and the like.
  • the priority order of the determination result is evaluated and the confidence analysis is performed.
  • the priority order of the attribute classification of the different attributes is: L0 (person) > L1 (sex) > L2 (person name) > L3 (expression). That is, first, it is judged whether or not the person can be identified, and the personless target object is output, and if possible, the gender is further recognized, and if the gender is confirmed, the name is recognized again.
  • the difficulty of identifying different attributes can be sorted according to the recognition algorithm of different attributes under the same conditions, such as the recognition rate of inputting the same image data. For example, it is often difficult to identify a person's name, gender recognition is difficult to face detection, or sort according to the mutual inclusion relationship between attributes. For example, to identify a gender, it is necessary to first detect the presence of a face. Among them, the discrimination of each attribute depends on the corresponding confidence. For example, when L0 recognizes a face, only the confidence exceeds W0, it is considered that the person is recognized; when L1 identifies the gender, only when the confidence exceeds W1, the gender can be recognized. When L2 recognizes a person's name, only when the confidence exceeds W2, the name recognition is successful.
  • the embodiment relates to an interactive terminal.
  • the attribute module, the judgment module, and the confidence module for implementing the hierarchical interaction decision are set in the interaction terminal.
  • the interaction terminal includes an information acquisition module 20, an identification module 22, an attribute module 30, a determination module 40, a confidence module 42, an output module 50, and an interaction module 60.
  • the information acquisition module 20 acquires target object information, and the recognition module 22 identifies the target object feature.
  • the attribute module 30 obtains a corresponding attribute ranking according to the target object feature, and the attribute ranking is prioritized from the primary attribute to the advanced attribute.
  • the judging module 40 starts from the primary attribute to the advanced attribute, and performs stepwise attribute determination based on the target object feature, and the target object feature is hierarchically migrated to the higher priority attribute when the current attribute classification is satisfied.
  • the output module 50 outputs the current attribute grading and the subordinate attribute grading judgment result as the basis of the interaction decision.
  • the confidence module is configured to perform a confidence analysis on the judgment result of each attribute classification.
  • the judging module is based on the judgment result of all subordinate attribute grading in the judgment of the superior attribute grading.
  • the target object information includes image information as well as audio information.
  • the attribute module 30 classifies the attributes of different groups.
  • the attribute module 30 is a face attribute module.
  • the information acquisition module 20 acquires image information of a face, and the recognition module 22 identifies a face feature based on the image information.
  • the face attribute module is configured to obtain a corresponding attribute ranking according to the facial feature. For example, the priority order of the attribute ranking is: person, gender, person name, and expression.
  • the determining module 40 starts from the attribute of the person to the expression attribute, and performs stepwise attribute determination based on the facial feature.
  • the confidence module 42 starts from the attribute of the person to the expression attribute, and performs level-by-level confidence determination based on the facial feature. When the face feature satisfies the grading standard of the current attribute grading, the attribute is migrated to a higher priority attribute;
  • the output module 50 When the facial feature does not satisfy the grading standard of the current attribute grading, the output module 50 outputs all the judgment results of the current attribute grading and the attribute grading of the following priority levels as the basis of the interactive decision.
  • the person and its attributes are identified from the screen according to the confidence level. Identification of attributes for people, such as people, names, genders, expressions, etc. From the perspective of confidence, the processing unit of the interactive terminal makes intelligent analysis and decision making on the screen:
  • T and D are detection thresholds, which can be used as a confidence level for judging the existence of a human face.
  • the attribute detection algorithm is used to identify attributes such as the gender of the person corresponding to the face image.
  • the gender identification algorithm as an example.
  • the name is detected by the face recognition technology to determine whether the detected face is a pre-stored or registered face, that is, the face similarity S ⁇ C, where C is a similarity threshold, which can be used as a confidence level for determining the name.
  • features for discriminating "face”, “gender”, and "person name” can be iteratively calculated from a picture, such as based on a multi-layer neural network principle.
  • the bottom layer of the neural network is calculated for discriminating No is the face feature of the face 1
  • the middle layer calculates the face feature 2 and the face feature 3 from the feature 1
  • the upper layer can be calculated based on the features of the next layer, for example, calculating the person from the face feature 2
  • the face feature 3 is used to discriminate a specific "person name”.
  • the terminal device Based on the identified person and its attributes, the terminal device sends a corresponding interaction signal to the person;
  • step 2) Based on the output of step 2), the intelligent machine responds accordingly.
  • the smart helmet If the output is ⁇ 1, Face>, the smart helmet emits a sound "someone in front";
  • the smart helmet If the output is ⁇ 1, Male>, the smart helmet emits a sound "There is a man in front”;
  • the smart helmet makes a sound "There is a lady in front”
  • the welcome robot makes a sound "Hello! What can I help you?";
  • the welcome robot makes a sound "Hello! Sir!”;
  • the welcome robot makes a sound "Hello! Ms!”;
  • the welcome robot sounds "Hello! VIP Customer NameInfo!.
  • the intelligent identification and decision-making process can be implemented in the interactive terminal or on the cloud server side.
  • the specific solution implemented in the cloud server refers to Embodiment 3.
  • the interactive terminal needs to collect the collected object information, such as an image or Audio data is transmitted to the cloud server.
  • the attribute module 30 is a vehicle attribute module.
  • the information acquisition module 20 acquires image information of the vehicle, and the recognition module 22 identifies the vehicle feature based on the image information.
  • the vehicle attribute module obtains a corresponding attribute ranking based on the vehicle characteristics. For example, the prioritization of the attribute hierarchy is: car, color, model, brand, and style.
  • the judging module 40 starts from the vehicle attribute to the style attribute, and performs stepwise attribute determination based on the vehicle characteristic
  • the confidence module 42 starts from the vehicle attribute to the style attribute, and performs stepwise confidence determination based on the vehicle characteristic, the vehicle
  • the attribute is migrated to a higher priority attribute
  • the output module 50 When the vehicle feature does not meet the grading criteria of the current attribute grading, the output module 50 outputs all the judgment results of the current attribute grading and the attribute grading of the following priority levels as the basis for the interactive decision.
  • the hierarchical decision method can be based on different target object features, and different features are extracted from the original input image or audio for different levels of decision.
  • the specific classification packet can be pre-stored in the interactive terminal.
  • the image information may be various, and may include vehicle recognition, fruit recognition, animal recognition, and the like in addition to face recognition.
  • the attribute module 30 is a sound attribute module.
  • the information acquisition module 20 acquires target object audio information, and the recognition module 22 identifies an audio feature of the target object based on the audio information.
  • the sound attribute module obtains a corresponding attribute level according to the audio feature of the target object. For example, the order of prioritization of the attribute hierarchy is: vocals, languages, keywords, and semantics.
  • the determining module 40 starts from the vocal attribute to the semantic attribute, and performs stepwise attribute determination based on the audio feature of the target object.
  • the confidence module 42 starts from the vocal attribute to the semantic attribute, and performs a step-by-step confidence judgment based on the audio feature of the target object.
  • the output module 50 outputs all the judgment results of the current attribute grading and the attribute grading of the following priority levels as the basis for the interactive decision.
  • the judgment result can be optimized based on the recognition rate of the algorithm, and the corresponding interactive output is given.
  • the order of decision making can be: vocal, language, keywords and semantics.
  • the interactive terminal can say "Hello!”; if the language is recognized, the intelligent interactive terminal can say “Hello!” in the corresponding language; if the keyword “finance” is recognized, the intelligent interactive terminal can say "There is the Bank of China's latest financial information, I don't know if you are interested.” If you identify the customer's intention "I want to understand high interest rate management,” the interactive terminal can say "high interest rate financial information is as follows".
  • the embodiment relates to a cloud server 100, wherein an attribute module, a determining module, and a confidence module for implementing hierarchical interaction decision are disposed in the cloud server 100.
  • the cloud server includes a receiving module 102 , a sending module 104 , an attribute module 130 , a determining module 140 , an output module 150 , and a confidence module 142 .
  • the receiving module 102 receives the target object feature identified by the interactive terminal according to the acquired target object information.
  • the attribute module 130 acquires a corresponding attribute ranking according to the target object feature, and the attribute ranking is prioritized from the primary attribute to the advanced attribute.
  • the determining module 140 starts from the primary attribute to the advanced attribute, and performs stepwise attribute determination based on the target object feature, and the target object feature is hierarchically migrated to a higher priority attribute when the current attribute classification is satisfied; the target object feature
  • the output module 150 outputs all the judgment results of the current attribute grading and the subordinate attribute grading as the basis of the interactive decision.
  • the sending module 104 sends the basis to the connected interactive terminal, and the interactive terminal performs a certain depth interaction with the user based on the received interaction decision.
  • the confidence module 142 performs a confidence analysis on the determination result of each attribute ranking.
  • the judging module 140 is based on the judgment result of all subordinate attribute gradings in the superior attribute grading judgment.
  • FIG. 10 a flow chart of implementing a hierarchical interaction decision method on the cloud server 100 side is shown.
  • Step 301 The receiving module receives, by the interaction terminal, a target object feature that is identified according to the acquired target object information.
  • Step 302 The attribute module acquires a corresponding attribute classification according to the target object feature, and the attribute level is prioritized from the primary attribute to the advanced attribute.
  • Step 303 The judging module starts from the primary attribute to the advanced attribute, and performs stepwise attribute determination based on the target object feature.
  • Step 304 The confidence module performs a confidence analysis on the judgment result of each attribute classification.
  • Step 305 Whether the confidence threshold grading standard is met
  • Step 306 Whether the grading standard of the current attribute grading is satisfied
  • Step 307 The target object feature is hierarchically migrated to a higher priority attribute when the grading standard of the current attribute grading is satisfied;
  • Step 308 When the target object feature does not meet the grading standard of the current attribute grading, the judgment result of the current attribute grading and the following attribute grading is output as the basis of the interactive decision;
  • Step 309 The sending module is configured to send the basis.
  • the embodiment relates to a hierarchical interaction decision method, which mainly includes the following steps:
  • Step 101 Acquire target object information, where the target object information includes image information and audio information;
  • Step 102 Identify the target object feature.
  • Step 103 Acquire a corresponding attribute ranking according to the target object feature, where the attribute ranking is prioritized from the primary attribute to the advanced attribute;
  • Step 104 Starting from the primary attribute to the advanced attribute, performing hierarchical attribute determination based on the target object feature, and the superior attribute leveling is based on the judgment result of all the lower attribute attributes in the judgment;
  • Step 105 The target object feature is hierarchically migrated to a higher priority attribute when the grading standard of the current attribute grading is satisfied;
  • Step 106 When the target object feature does not meet the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interaction decision.
  • the hierarchical interaction decision method further includes a confidence analysis step when performing stepwise attribute determination based on the target object feature.
  • Step 201 Perform a confidence analysis on the judgment result of each attribute classification to ensure the recognition accuracy rate
  • Step 203 Whether the grading standard of the current attribute grading is satisfied, for example, whether the face is recognized, and if the face is recognized, it is further determined whether the character gender, etc.;
  • Step 205 Whether the confidence threshold grading criterion is met, for example, whether the face meets the threshold grading standard of the face image data, or whether the gender satisfies the set image feature threshold, such as the length of the hair;
  • Step 207 The target object feature is hierarchically migrated to a higher priority attribute when the grading standard of the current attribute grading is satisfied;
  • Step 209 When the target object feature does not satisfy the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interaction decision.
  • the hierarchical interactive decision method, the interactive terminal and the cloud server of the embodiment pre-set the attribute grading of the target object, prioritize the attribute grading, and perform attribute judgment step by step according to the target object features recognized by the machine vision and the robot semantic understanding.
  • the interactive decision basis for the current target object information is outputted, so that the interactive terminal is more intelligent and flexible, and the user experience is improved.
  • the interactive content between the interactive terminal and the user is more rich and interesting, and the recognition person is taken as an example: when the camera is facing the camera under the condition of good illumination and close proximity, the name of the person can be recognized; Half face, or side-to-machine camera, can only identify the gender of the person; when facing the camera, it can only identify whether it is an individual, or take the identification of the vehicle as an example: the blind person walks with the guide helmet on the side of the road Sometimes it is possible to identify the model and color, and sometimes only recognize the color. Interactive content is more interesting and interesting.
  • FIG. 11 is a hardware structure of an electronic device 600 for hierarchical hierarchical decision making according to an embodiment of the present application.
  • the electronic device 600 includes:
  • One or more processors 610, a memory 620, an audio data collector 630, a video data collector 640, a communication component 650, and a display unit 660 are illustrated by one processor 610 in FIG.
  • the output of the audio data collector is the input of an audio recognition module, and the output of the video data collector identifies the input of the module.
  • the memory 620 stores instructions executable by the at least one processor 610, the instructions being invoked by the at least one processor to invoke data of the audio data collector and the video data collector, and the communication component 650 establishes a connection with the cloud server. To enable the at least one processor to perform the hierarchical interaction decision method.
  • the processor 610, the memory 620, the display unit 660, and the human-machine interaction unit 630 may be connected by a bus or other means, and the connection by a bus is taken as an example in FIG.
  • the memory 620 is a non-volatile computer readable storage medium, and can be used for storing a non-volatile software program, a non-volatile computer executable program, and a module, such as a program corresponding to the hierarchical interactive decision method in the embodiment of the present application.
  • An instruction/module for example, the identification module 22, the attribute module 30, the determination module 40, the confidence module 42 and the interaction module 60 shown in FIG. 2).
  • the processor 610 executes various functional applications and data processing of the server by running non-volatile software programs, instructions, and modules stored in the memory 620, that is, implementing the hierarchical interaction decision method in the above method embodiments.
  • the memory 620 may include a storage program area and an storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the interactive terminal, and the like.
  • memory 620 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device.
  • memory 620 can optionally include memory remotely located relative to processor 610, which can be connected to the robotic interactive electronic device via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the one or more modules are stored in the memory 620, and when executed by the one or more processors 610, perform a hierarchical interaction decision method in any of the above method embodiments, for example, performing the above described FIG.
  • steps 101 to 106 of the method the method steps 201 to 209 in FIG. 4 described above are executed, and the identification module 22, the attribute module 30, the determination module 40, the confidence module 42 and the interaction module 60 and the diagram in FIG. 2 are implemented. 9 functions of the attribute module 130, the determination module 140, the confidence module 142, and the transmission module 104.
  • the electronic device of the embodiment of the present application exists in various forms, including but not limited to:
  • Mobile communication equipment This type of equipment is characterized by its mobile communication function.
  • Three-dimensional display devices These devices can display and play multimedia content. Such devices include: virtual reality helmets, enhanced display helmets, or enhanced display glasses.
  • Server A device that provides computing services.
  • the server consists of a processor, a hard disk, a memory, a system bus, etc.
  • the server is similar to a general-purpose computer architecture, but because of the need to provide highly reliable services, processing power and stability High reliability in terms of reliability, security, scalability, and manageability.
  • Embodiments of the present application provide a non-transitory computer readable storage medium storing computer-executable instructions that are executed by one or more processors, for example, to perform the above
  • the method steps 101 to 106 in FIG. 3 described above, the method steps 201 to 209 in FIG. 4 described above are performed, and the identification module 22, the attribute module 30, the determination module 40, the confidence module 42 and the interaction in FIG. 2 are implemented.
  • the device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Abstract

A hierarchical interactive decision making method comprises the following steps: obtaining information about a target object (101); identifying a feature of the target object (102); obtaining a corresponding attribute grade according to the feature of the target object, and prioritizing the attribute grade (103); performing, according to a priority sequence of the attribute grade, a grade-by-grade attribute determination on the feature of the target object (104); if the feature of the target object satisfies a grading standard of the current attribute grade, migrating the feature of the target object to a higher attribute grade (105); and if the feature of the target object does not satisfy a grading standard of the current attribute grade, outputting a determination result of the current attribute grade and an attribute grade of a lower priority as a basis for making an interactive decision (106).

Description

一种分级交互决策方法、交互终端以及云端服务器Hierarchical interactive decision making method, interactive terminal and cloud server 技术领域Technical field
本申请涉及机器人交互领域,具体涉及一种分级交互决策方法、交互终端以及云端服务器。The present application relates to the field of robot interaction, and specifically relates to a hierarchical interaction decision method, an interactive terminal, and a cloud server.
背景技术Background technique
随着网络传输和大数据科技的发展以及硬件处理能力的提升,越来越多的机器人走进了人们的家庭生活。当前的人机交互方式基本都是人问机器答,尽管回答方式多种多样,且越来越智能,但大多是机器人被动接收用户的提问信息,进行筛选后直接答复用户。筛选信息之间缺乏关联。With the development of network transmission and big data technology and the improvement of hardware processing capabilities, more and more robots have entered people's family life. The current human-computer interaction methods are basically people's questions. Although the answer methods are various and more intelligent, most of them are passively receiving the user's question information, and responding directly to the user after screening. There is a lack of correlation between screening information.
随着智能设备的出现和普及,智能设备与人之间的交互变得越来越频繁,人机互动的自然体验问题需求越来越大。比如,智能导盲设备与盲人间的交互,或者迎宾机器人与客人间的交互。With the emergence and popularity of smart devices, the interaction between smart devices and people has become more and more frequent, and the natural experience of human-computer interaction has become more and more demanding. For example, the interaction between a smart guide device and a blind person, or the interaction between a welcome robot and a guest.
例如在导盲场景中,导盲设备如果检测到人的信息,经过图像分析可以确定更多人物特征。根据不同的任务特征给盲人发出提醒语音,例如识别出姓名或者性别时语音提示“前面是你的朋友小明”,“前面有位女性”。在迎宾机器人场景中,迎宾机器人根据机器视觉识别来宾,主动通过语音打招呼,例如“您好!尊敬的VIP客户张小明”,“您好,女士!”。这样的互动能带给用户友好体验,提升服务品质。For example, in a guide blind scene, if the guide device detects the person's information, image analysis can determine more character features. According to different task characteristics, the blind person is given a reminder voice, for example, when the name or gender is recognized, the voice prompt “Before your friend Xiao Ming” and “There is a woman in front”. In the welcome robot scene, the welcome robot recognizes the guests according to the machine vision and greets them by voice. For example, “Hello! Dear VIP customer Zhang Xiaoming”, “Hello, Ms!”. This kind of interaction can bring a user-friendly experience and improve service quality.
但是,现有技术的交互终端,根据机器视觉识别的多种对象特征直接进行交互,使得某些场景下基于先进的机器视觉也会出现交互尴尬,比如,因为光线、角度或者遮挡的原因无法识别人物,无法确保男性和女性、无法检测出人的表情及年龄等机器智能无法判断的时候,如果交互用户是女性,而交互终端说“您好,先生!”会因为性别识别错误而带来尴尬。However, the prior art interactive terminal directly interacts according to various object features recognized by the machine vision, so that interactions based on advanced machine vision may also occur in some scenes, for example, due to light, angle or occlusion. People can't ensure that male and female, unable to detect human expression and age, and other machine intelligence can't judge, if the interactive user is a female, and the interactive terminal says "Hello, sir!" will be caused by gender recognition errors. .
如上所述,如何利用机器智能在用户友好体验和可靠性之间达到折衷,是急需解决的问题。As mentioned above, how to use machine intelligence to achieve a compromise between user-friendly experience and reliability is an urgent problem to be solved.
因此,现有技术的机器人交互技术还有待于改进。Therefore, the prior art robot interaction technology has yet to be improved.
发明内容Summary of the invention
本申请提供一种分级交互决策方法、交互终端以及云端服务器,预先设置目标对象的属性分级,属性分级之间进行合理的优先级排序,形成多级分层神经网络。根据机器视觉和机器人语义理解识别的目标对象特征逐级进行属性判 断,并输出针对当前目标对象信息多的交互决策依据,可尽量识别出人物和物体更细节的属性,使得交互终端更加智能和灵活,提升用户体验。The present application provides a hierarchical interaction decision method, an interactive terminal, and a cloud server, which pre-sets the attribute classification of the target object, and prioritizes the attribute rankings to form a multi-level hierarchical neural network. Attribute judgment based on the target object features recognized by machine vision and robot semantic understanding Break and output the interactive decision-making basis for the current target object information, and try to identify the more detailed attributes of the person and the object, making the interactive terminal more intelligent and flexible, and improving the user experience.
为解决上述技术问题,本申请提供以下技术方案。To solve the above technical problem, the present application provides the following technical solutions.
第一方面,本申请实施例提供了一种分级交互决策方法,包括以下步骤:In a first aspect, an embodiment of the present application provides a hierarchical interaction decision method, including the following steps:
获取目标对象信息,识别该目标对象特征;Obtaining target object information and identifying the target object feature;
根据该目标对象特征获取对应的属性分级,对该属性分级进行优先排序;Obtaining a corresponding attribute ranking according to the target object feature, and prioritizing the attribute ranking;
根据所述属性分级的优先排序次序,对该目标对象特征进行逐级的属性判断,该目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;Performing a stepwise attribute determination on the target object feature according to the prioritization order of the attribute ranking, and classifying the attribute to a higher priority attribute when the target object feature satisfies the current attribute classification grading standard;
该目标对象特征不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the target object feature does not satisfy the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interactive decision.
第二方面,本申请实施例还提供了一种交互终端,包括信息获取模块、识别模块、属性模块、判断模块以及输出模块,In a second aspect, an embodiment of the present application further provides an interaction terminal, including an information acquisition module, an identification module, an attribute module, a determination module, and an output module.
该信息获取模块用于获取目标对象信息,该识别模块用于识别该目标对象特征;The information acquiring module is configured to acquire target object information, and the identifying module is configured to identify the target object feature;
该属性模块用于根据该目标对象特征获取对应的属性分级,对该属性分级进行优先排序;The attribute module is configured to obtain a corresponding attribute classification according to the target object feature, and prioritize the attribute classification;
该判断模块用于根据所述属性分级的优先排序次序,对该目标对象特征进行逐级的属性判断,该目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The determining module is configured to perform a stepwise attribute determination on the target object feature according to the prioritized order of the attribute ranking, and the target object feature is hierarchically migrated to a higher priority attribute when the current attribute classification is satisfied by the classification criterion;
该目标对象模块不满足当前属性分级的分级标准时,该输出模块用于输出当前属性分级以及下级属性分级的判断结果作为交互决策的依据。When the target object module does not meet the grading standard of the current attribute grading, the output module is used to output the current attribute grading and the subordinate attribute grading judgment result as the basis of the interaction decision.
第三方面,本申请实施例还提供了一种云端服务器,包括接收模块、属性模块、判断模块、输出模块以及发送模块,In a third aspect, the embodiment of the present application further provides a cloud server, including a receiving module, an attribute module, a determining module, an output module, and a sending module.
该接收模块用于接收交互终端发送的根据获取的目标对象信息识别的目标对象特征;The receiving module is configured to receive a target object feature that is sent by the interaction terminal and is identified according to the acquired target object information;
该属性模块用于根据该目标对象特征获取对应的属性分级,对该属性分级进行优先排序;The attribute module is configured to obtain a corresponding attribute classification according to the target object feature, and prioritize the attribute classification;
该判断模块用于根据所述属性分级的优先排序次序,基于该目标对象特征进行逐级的属性判断,该目标对象模块满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The judging module is configured to perform hierarchical attribute determination based on the priority order of the attribute ranking based on the target object feature, and the target object module is hierarchically migrated to a higher priority attribute when the current attribute classification grading standard is satisfied;
该目标对象模块不满足当前属性分级的分级标准时,该输出模块用于输出 当前属性分级以及下级属性分级的判断结果作为交互决策的依据;The output module is used for output when the target object module does not meet the grading criteria of the current attribute hierarchy. The current attribute grading and the judgment result of the subordinate attribute grading are used as the basis for the interactive decision;
该发送模块用于发送该依据。The sending module is configured to send the basis.
第四方面,本申请实施例还提供了一种电子设备,包括:In a fourth aspect, the embodiment of the present application further provides an electronic device, including:
至少一个处理器;以及,At least one processor; and,
与该至少一个处理器通信连接的存储器,通信组件、音频数据采集器以及视频数据采集器;其中,a memory communicatively coupled to the at least one processor, a communication component, an audio data collector, and a video data collector; wherein
该存储器存储有可被该至少一个处理器执行的指令,该指令被该至少一个处理器执行时调用音频数据采集器与视频数据采集器的数据,通过通信组件与云端服务器建立连接,以使该至少一个处理器能够执行如上所述的方法。The memory stores instructions executable by the at least one processor, the instructions being invoked by the at least one processor to invoke data of the audio data collector and the video data collector, and establishing a connection with the cloud server through the communication component to enable the At least one processor is capable of performing the method as described above.
第五方面,本申请实施例还提供了一种非易失性计算机可读存储介质,所述计算机可读存储介质存储有计算机可执行指令,所述计算机可执行指令用于使计算机执行如上所述的方法。In a fifth aspect, the embodiment of the present application further provides a non-transitory computer readable storage medium, where the computer-readable storage medium is stored, where the computer-executable instructions are used to cause a computer to execute the above The method described.
第六方面,本申请实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在非易失性计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行如上所述的方法。In a sixth aspect, the embodiment of the present application further provides a computer program product, the computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when When the program instructions are executed by the computer, the computer is caused to perform the method as described above.
本申请的有益效果在于,本申请实施例提供的分级交互决策方法、交互终端以及云端服务器,预先设置合理的目标对象的属性分级,属性分级之间进行优先级排序,形成多级分层神经网络,根据机器视觉和机器人语义理解识别的目标对象特征逐级进行属性判断,并输出针对当前目标对象信息最佳的交互决策依据,使得交互终端更加智能和灵活,提升用户体验。本实施例中,通过分级方式尽量确定目标对象,比如人物、物体的更多属性,为实际交互应用场景提供更多交互决策依据,从而提升用户深度交互体验。The utility model has the beneficial effects that the hierarchical interaction decision method, the interaction terminal and the cloud server provided by the embodiments of the present application pre-set a reasonable attribute classification of the target object, prioritize the attribute classifications, and form a multi-level hierarchical neural network. According to the machine object and the semantic definition of the robot, the target object features are judged step by step, and the optimal decision basis for the current target object information is outputted, so that the interactive terminal is more intelligent and flexible, and the user experience is improved. In this embodiment, the target object, such as the attributes of the person and the object, is determined as much as possible, and more interactive decision-making basis is provided for the actual interactive application scenario, thereby improving the user's deep interactive experience.
附图说明DRAWINGS
一个或多个实施例通过与之对应的附图中的图片进行示例性说明,这些示例性说明并不构成对实施例的限定,附图中具有相同参考数字标号的元件表示为类似的元件,除非有特别申明,附图中的图不构成比例限制。The one or more embodiments are exemplified by the accompanying drawings in the accompanying drawings, and FIG. The figures in the drawings do not constitute a scale limitation unless otherwise stated.
图1是本申请实施例提供的分级交互决策的系统架构图;1 is a system architecture diagram of hierarchical interaction decision provided by an embodiment of the present application;
图2是本申请实施例提供的交互终端的模块图;2 is a block diagram of an interaction terminal provided by an embodiment of the present application;
图3是本申请实施例提供的交互终端的分级交互决策主要流程图;3 is a main flowchart of hierarchical interaction decision of an interactive terminal according to an embodiment of the present application;
图4是本申请实施例提供的交互终端的分级交互决策整体流程图; 4 is an overall flowchart of hierarchical interaction decision of an interactive terminal according to an embodiment of the present application;
图5是本申请实施例提供的交互终端的人脸识别实施例的逐级判断流程图;FIG. 5 is a flowchart of step-by-step determination of a face recognition embodiment of an interactive terminal according to an embodiment of the present application; FIG.
图6是本申请实施例提供的交互终端的多级分层神经网络图;6 is a multi-level hierarchical neural network diagram of an interaction terminal according to an embodiment of the present application;
图7是本申请实施例提供的交互终端的车辆识别实施例的逐级判断流程图;FIG. 7 is a flowchart of step-by-step determination of a vehicle identification embodiment of an interactive terminal according to an embodiment of the present application; FIG.
图8本申请实施例提供的交互终端的音频识别实施例的逐级判断流程图;FIG. 8 is a flowchart of step-by-step judgment of an audio recognition embodiment of an interactive terminal according to an embodiment of the present application;
图9是本申请实施例提供的云端服务器模块图;FIG. 9 is a schematic diagram of a cloud server module according to an embodiment of the present application;
图10是本申请实施例提供的云端服务器一侧实现分级交互决策方法的流程图;以及10 is a flowchart of a method for implementing hierarchical interaction decision on a cloud server side provided by an embodiment of the present application;
图11是本申请实施例提供的分级交互决策方法的电子设备的硬件结构示意图。FIG. 11 is a schematic diagram showing the hardware structure of an electronic device according to the hierarchical interaction decision method provided by the embodiment of the present application.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。In order to make the objects, technical solutions, and advantages of the present application more comprehensible, the present application will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
本申请实施例提供的分级交互决策方法、交互终端以及云端服务器,预先设置目标对象的属性分级,属性分级之间进行优先级排序,形成多级分层神经网络,根据机器视觉和机器人语义理解识别的目标对象特征逐级进行属性判断,并输出针对当前目标对象信息最多的交互决策依据,使得交互终端更加智能和灵活,提升用户体验。The hierarchical interaction decision method, the interaction terminal and the cloud server provided by the embodiments of the present application pre-set attribute classification of the target object, prioritize the attribute classification, form a multi-level hierarchical neural network, and recognize and recognize according to machine vision and robot semantics. The target object feature performs attribute judgment step by step, and outputs the interaction decision basis with the most information about the current target object, so that the interactive terminal is more intelligent and flexible, and improves the user experience.
本申请的分级交互决策内容包括获取目标对象信息,识别该目标对象特征;根据该目标对象特征获取对应的属性分级,对该属性分级进行优先排序;根据该属性分级的优先排序次序,基于该目标对象特征进行逐级的属性判断,满足当前属性分级的分级标准时往优先级更高的属性分级迁移;不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。The hierarchical interactive decision content of the present application includes: acquiring target object information, identifying the target object feature; acquiring corresponding attribute classification according to the target object feature, prioritizing the attribute ranking; and ranking the priority order according to the attribute classification, based on the target The object feature performs step-by-step attribute judgment, and when the current attribute classification grading standard is satisfied, the attribute is graded to a higher priority attribute; when the current attribute grading grading standard is not satisfied, the current attribute grading and the following priority attribute grading result are outputted. As the basis for interactive decision making.
对该属性分级进行优先排序可以是从初级属性开始往高级属性排序,从初级属性开始往高级属性基于该目标对象特征进行逐级的属性判断。可以理解是的,基于所述目标对象特征进行逐级的属性判断也可以是从高级属性到初级属性进行逐级判断,只要能识别出更深入的结果即可作为系统的结果输出,从而尽量识别出人物和物体更细节的属性,为深度交互提供支持。The prioritization of the attribute ranking may be starting from a primary attribute to an advanced attribute, and starting from the primary attribute to the advanced attribute based on the target object feature for progressive attribute determination. It can be understood that the stepwise attribute judgment based on the target object feature may also be judged step by step from the advanced attribute to the primary attribute, as long as the deeper result can be recognized as the result output of the system, thereby identifying as much as possible. More detailed attributes of characters and objects to support deep interaction.
本申请的分级交互决策方法、交互终端以及云端服务器为基于置信优先级的机器智能优化决策方法,实现对人物和物体的优化识别,尽量识别出人物和物体更细节的属性,从而实现终端与人之间的友好互动。 The hierarchical interactive decision-making method, the interactive terminal and the cloud server of the present application are machine-optimized decision-making methods based on confidence priority, which realizes optimal recognition of characters and objects, and tries to identify more detailed attributes of characters and objects, thereby realizing terminal and person. Friendly interaction between.
请参考图1,本申请的分级交互决策系统,每一交互设备均连接至云端服务器100。该交互终端100可以是智能眼镜110,可以是机器人120,可以是智能终端130也可以是智能头盔140等。Referring to FIG. 1 , in the hierarchical interaction decision system of the present application, each interaction device is connected to the cloud server 100. The interactive terminal 100 can be smart glasses 110, can be a robot 120, can be a smart terminal 130, or can be a smart helmet 140 or the like.
交互终端在工作时,交互终端采集对面目标对象信息,比如画面图像信息或声音音频信息等;从目标对象信息中分级自适应地识别目标对象以及目标对象特征;基于识别出的目标对象以及目标对象特征,交互终端设备向人发出相应的交互信息;用户对交互终端做出响应等。When the interactive terminal is working, the interactive terminal collects information about the opposite target object, such as picture image information or audio and audio information; and hierarchically and adaptively identifies the target object and the target object feature from the target object information; based on the identified target object and the target object The feature, the interactive terminal device sends corresponding interaction information to the person; the user responds to the interactive terminal and the like.
为确保识别准确率,防止造成不必要的错误和尴尬,本申请中,在基于该目标对象特征进行逐级的属性判断时,对判断结果进行置信的优先级顺序以及进行置信度分析。例如,以识别人物为例,根据交互终端对人物的各种不同属性的识别难易程度,不同属性的属性分级的优先级顺序是:L0(人)>L1(性别)>L2(人名)>L3(表情)。亦即,首先判断确定是否能识别出人,则输出无人物目标对象,如果能则而进一步识别性别,如果能确认性别则再识别是否可识别姓名。其中,不同属性的识别难易程度,可以依据不同属性的识别算法在相同条件下,比如输入相同的图像数据的识别率来排序。例如,通常人名识别难于性别识别,性别识别难于人脸检测)或者依据属性间的相互包含关系来排序。例如,要识别性别需先检测到人脸的存在。其中,对每种属性的判别都依赖于相应的置信度,例如,L0识别人脸时只有置信度超过W0才认为识别出人物;L1识别性别时,只有置信度超过W1才认为可以识别出性别;L2识别人名时,只有置信度超过W2才认为人名识别成功。In order to ensure the recognition accuracy and prevent unnecessary errors and defects, in the present application, when the attribute determination is performed step by step based on the target object feature, the priority order of the determination result is evaluated and the confidence analysis is performed. For example, taking the identification of a character as an example, according to the difficulty level of the identification of various attributes of the character by the interactive terminal, the priority order of the attribute classification of the different attributes is: L0 (person) > L1 (sex) > L2 (person name) > L3 (expression). That is, first, it is judged whether or not the person can be identified, and the personless target object is output, and if possible, the gender is further recognized, and if the gender is confirmed, the name is recognized again. Among them, the difficulty of identifying different attributes can be sorted according to the recognition algorithm of different attributes under the same conditions, such as the recognition rate of inputting the same image data. For example, it is often difficult to identify a person's name, gender recognition is difficult to face detection, or sort according to the mutual inclusion relationship between attributes. For example, to identify a gender, it is necessary to first detect the presence of a face. Among them, the discrimination of each attribute depends on the corresponding confidence. For example, when L0 recognizes a face, only the confidence exceeds W0, it is considered that the person is recognized; when L1 identifies the gender, only when the confidence exceeds W1, the gender can be recognized. When L2 recognizes a person's name, only when the confidence exceeds W2, the name recognition is successful.
实施例1Example 1
请参考图2,本实施例涉及交互终端。其中,用于实现分级交互决策的属性模块、判断模块以及置信模块设置在该交互终端中。Referring to FIG. 2, the embodiment relates to an interactive terminal. The attribute module, the judgment module, and the confidence module for implementing the hierarchical interaction decision are set in the interaction terminal.
该交互终端包括信息获取模块20、识别模块22、属性模块30、判断模块40、置信模块42、输出模块50以及交互模块60。The interaction terminal includes an information acquisition module 20, an identification module 22, an attribute module 30, a determination module 40, a confidence module 42, an output module 50, and an interaction module 60.
该信息获取模块20获取目标对象信息,该识别模块22识别该目标对象特征。该属性模块30根据该目标对象特征获取对应的属性分级,该属性分级从初级属性到高级属性进行优先排序。The information acquisition module 20 acquires target object information, and the recognition module 22 identifies the target object feature. The attribute module 30 obtains a corresponding attribute ranking according to the target object feature, and the attribute ranking is prioritized from the primary attribute to the advanced attribute.
该判断模块40从该初级属性开始往高级属性,基于该目标对象特征进行逐级的属性判断,该目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移。该目标对象特征不满足当前属性分级的分级标准时,该输出模块50输出当前属性分级以及下级属性分级的判断结果作为交互决策的依据。The judging module 40 starts from the primary attribute to the advanced attribute, and performs stepwise attribute determination based on the target object feature, and the target object feature is hierarchically migrated to the higher priority attribute when the current attribute classification is satisfied. When the target object feature does not satisfy the grading standard of the current attribute grading, the output module 50 outputs the current attribute grading and the subordinate attribute grading judgment result as the basis of the interaction decision.
该置信模块用于对每一属性分级的判断结果进行置信度分析。 The confidence module is configured to perform a confidence analysis on the judgment result of each attribute classification.
该判断模块在上级属性分级判断时以所有下级属性分级的判断结果作为依据。The judging module is based on the judgment result of all subordinate attribute grading in the judgment of the superior attribute grading.
该目标对象信息包括图像信息以及音频信息。The target object information includes image information as well as audio information.
该属性模块30对不同组别的属性进行分类处理。The attribute module 30 classifies the attributes of different groups.
请参考图5和图6,在一实施例中,该属性模块30为人脸属性模块。该信息获取模块20获取人脸的图像信息,该识别模块22根据该图像信息识别人脸特征。该人脸属性模块用于根据该人脸特征获取对应的属性分级,比如,该属性分级的优先排序次序为:人、性别、人名以及表情。Referring to FIG. 5 and FIG. 6, in an embodiment, the attribute module 30 is a face attribute module. The information acquisition module 20 acquires image information of a face, and the recognition module 22 identifies a face feature based on the image information. The face attribute module is configured to obtain a corresponding attribute ranking according to the facial feature. For example, the priority order of the attribute ranking is: person, gender, person name, and expression.
该判断模块40从该人属性开始往表情属性,基于该人脸特征进行逐级的属性判断,该置信模块42从该人属性开始往表情属性,基于该人脸特征进行逐级置信度判断,该人脸特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The determining module 40 starts from the attribute of the person to the expression attribute, and performs stepwise attribute determination based on the facial feature. The confidence module 42 starts from the attribute of the person to the expression attribute, and performs level-by-level confidence determination based on the facial feature. When the face feature satisfies the grading standard of the current attribute grading, the attribute is migrated to a higher priority attribute;
该人脸特征不满足当前属性分级的分级标准时,该输出模块50输出当前属性分级及其以下优先级的属性分级的所有判断结果作为交互决策的依据。When the facial feature does not satisfy the grading standard of the current attribute grading, the output module 50 outputs all the judgment results of the current attribute grading and the attribute grading of the following priority levels as the basis of the interactive decision.
本实施例中,依据置信度分级从画面中识别人及其属性。针对人的属性识别,比如,人、姓名、性别、表情等。从置信度角度,交互终端的处理单元对画面做智能分析和决策:In this embodiment, the person and its attributes are identified from the screen according to the confidence level. Identification of attributes for people, such as people, names, genders, expressions, etc. From the perspective of confidence, the processing unit of the interactive terminal makes intelligent analysis and decision making on the screen:
在检测是否有人存在时,相当于检测此人是否在关注交互设备。通过人体检测/人脸检测技术,检测是否存在人;通过人脸姿态估计技术检测人脸的姿态,比如空间朝向,以度为单位。判断人脸姿态是否较长时间t>=T,T是阈值,可通过经验设定,例如T=2秒。朝向交互终端的角度差d<=D,D是阈值,可通过经验设定,例如D=20度。其中,T和D是检测阈值,可作为判别人脸存在的置信度。When detecting whether someone is present, it is equivalent to detecting whether the person is paying attention to the interactive device. The human body detection/face detection technology is used to detect the presence of a person; the face pose estimation technique is used to detect the posture of the face, such as the spatial orientation, in degrees. It is judged whether the face posture is long time t>=T, and T is a threshold value, which can be set by experience, for example, T=2 seconds. The angle difference d<=D toward the interactive terminal, D is a threshold, which can be set empirically, for example D=20 degrees. Among them, T and D are detection thresholds, which can be used as a confidence level for judging the existence of a human face.
检测人的其他属性,例如性别。通过属性检测算法,来识别人脸图像对应的人的性别等属性。后续以性别识别算法为例来介绍。考虑到属性识别算法的精确度不足,我们依据“男性”置信度Na、“女性”置信度Nv这两个参数做如下决策:如果Na-Nv≥R,则决策结果为男性;如果Nv-Na≥R,则决策结果为女性;否则,输出<0,Face>,表示没有识别出人脸。其中,R是衡量性别差异的阈值,可作为判别性别的置信度,可依据经验选取,例如R=20(Nv+Na=100)。Detect other attributes of people, such as gender. The attribute detection algorithm is used to identify attributes such as the gender of the person corresponding to the face image. Followed by the gender identification algorithm as an example. Considering the lack of accuracy of the attribute recognition algorithm, we make the following decisions based on the two parameters of “male” confidence Na and “female” confidence Nv: if Na-Nv≥R, the decision result is male; if Nv-Na ≥R, the decision result is female; otherwise, the output <0, Face>, indicating that the face is not recognized. Among them, R is the threshold for measuring gender differences, which can be used as a confidence level for discriminating gender. It can be selected according to experience, for example, R=20 (Nv+Na=100).
检测姓名是,通过人脸识别技术来判断检测到的人脸是否是预先存储或者注册的人脸,即人脸相似度S≥C,其中C是相似度阈值,可作为判别姓名的置信度。The name is detected by the face recognition technology to determine whether the detected face is a pre-stored or registered face, that is, the face similarity S ≥ C, where C is a similarity threshold, which can be used as a confidence level for determining the name.
请参考图6,用于判别“人脸”、“性别”和“人名”的特征可以从图片中迭代式地计算,例如基于多层神经网络原理。神经网络的底层计算出用于判别是 否是人脸的人脸特征1,中间层从特征1计算出人脸特征2以及人脸特征3,上一层可以基于下一层的特征进行计算,例如,从人脸特征2计算出人脸特征3用于判别具体的“人名”。Referring to FIG. 6, features for discriminating "face", "gender", and "person name" can be iteratively calculated from a picture, such as based on a multi-layer neural network principle. The bottom layer of the neural network is calculated for discriminating No is the face feature of the face 1, the middle layer calculates the face feature 2 and the face feature 3 from the feature 1, and the upper layer can be calculated based on the features of the next layer, for example, calculating the person from the face feature 2 The face feature 3 is used to discriminate a specific "person name".
基于识别出的人及其属性,终端设备向人发出相应的交互信号;Based on the identified person and its attributes, the terminal device sends a corresponding interaction signal to the person;
基于步骤2)的输出结果,智能机器做相应的响应。Based on the output of step 2), the intelligent machine responds accordingly.
以导盲头盔为例:Take the blind guide helmet as an example:
如果输出为<1,Face>,则智能头盔发出声音“前面有人”;If the output is <1, Face>, the smart helmet emits a sound "someone in front";
如果输出为<1,Male>,则智能头盔发出声音“前面有位男士”;If the output is <1, Male>, the smart helmet emits a sound "There is a man in front";
如果输出为<1,Female>,则智能头盔发出声音“前面有位女士”;If the output is <1,Female>, the smart helmet makes a sound "There is a lady in front";
如果输出为<2,NameInfo>,则智能头盔发出声音“前面是NameInfo”;If the output is <2, NameInfo>, the smart helmet emits a sound "Before NameInfo";
以迎宾机器人为例:Take the welcome robot as an example:
如果输出为<1,Face>,则迎宾机器人发出声音“您好!有什么可以帮您?”;If the output is <1, Face>, the welcome robot makes a sound "Hello! What can I help you?";
如果输出为<1,Male>,则迎宾机器人发出声音“您好!先生!”;If the output is <1, Male>, the welcome robot makes a sound "Hello! Sir!";
如果输出为<1,Female>,则迎宾机器人发出声音“您好!女士!”;If the output is <1,Female>, the welcome robot makes a sound "Hello! Ms!";
如果输出为<2,NameInfo>,则迎宾机器人发出声音“您好!VIP客户NameInfo!”。If the output is <2, NameInfo>, the welcome robot sounds "Hello! VIP Customer NameInfo!".
智能识别和决策过程可以在交互终端实现,也可以在云端服务器一侧实现,在云端服务器实现的具体方案参考实施例3,该实施例中,交互终端需要把采集到的对象信息,比如图像或者音频数据传输到云端服务器。The intelligent identification and decision-making process can be implemented in the interactive terminal or on the cloud server side. The specific solution implemented in the cloud server refers to Embodiment 3. In this embodiment, the interactive terminal needs to collect the collected object information, such as an image or Audio data is transmitted to the cloud server.
在另一实施例中,该属性模块30为车辆属性模块。In another embodiment, the attribute module 30 is a vehicle attribute module.
该信息获取模块20获取车辆的图像信息,该识别模块22根据该图像信息识别车辆特征。该车辆属性模块根据该车辆特征获取对应的属性分级。比如,该属性分级的优先排序次序为:车、颜色、车型、品牌以及款式。The information acquisition module 20 acquires image information of the vehicle, and the recognition module 22 identifies the vehicle feature based on the image information. The vehicle attribute module obtains a corresponding attribute ranking based on the vehicle characteristics. For example, the prioritization of the attribute hierarchy is: car, color, model, brand, and style.
该判断模块40从该车属性开始往款式属性,基于该车辆特征进行逐级的属性判断,该置信模块42从该车属性开始往款式属性,基于该车辆特征进行逐级置信度判断,该车辆特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The judging module 40 starts from the vehicle attribute to the style attribute, and performs stepwise attribute determination based on the vehicle characteristic, and the confidence module 42 starts from the vehicle attribute to the style attribute, and performs stepwise confidence determination based on the vehicle characteristic, the vehicle When the feature meets the grading criteria of the current attribute grading, the attribute is migrated to a higher priority attribute;
该车辆特征不满足当前属性分级的分级标准时,该输出模块50输出当前属性分级及其以下优先级的属性分级的所有判断结果作为交互决策的依据。When the vehicle feature does not meet the grading criteria of the current attribute grading, the output module 50 outputs all the judgment results of the current attribute grading and the attribute grading of the following priority levels as the basis for the interactive decision.
请参考图7,例如,以车辆识别为例,依次判断是否存在车辆、车辆颜色、识别车辆车型、识别车辆品牌、识别车辆款式等。判断结果可以在考虑算法识 别率的基础上做优化决策,并给出相应的交互输出。其决策顺序如图7所示。Referring to FIG. 7 , for example, taking vehicle identification as an example, it is sequentially determined whether there is a vehicle, a vehicle color, a vehicle model is recognized, a vehicle brand is recognized, a vehicle style is recognized, and the like. The result of the judgment can be considered in the algorithm Make optimization decisions based on the rate and give the corresponding interactive output. The decision sequence is shown in Figure 7.
本实施例交互终端采用分级决策方法可以依赖于不同的目标对象特征,从原始输入图像或者音频中提取出不同的特征用于不同级别的决策,具体分类分组可预先存储在交互终端。In this embodiment, the hierarchical decision method can be based on different target object features, and different features are extracted from the original input image or audio for different levels of decision. The specific classification packet can be pre-stored in the interactive terminal.
该图像信息可以是多种多样,除了人脸识别,还可以是车辆识别、水果识别、动物识别等。The image information may be various, and may include vehicle recognition, fruit recognition, animal recognition, and the like in addition to face recognition.
在又一实施例中,该属性模块30为声音属性模块。In yet another embodiment, the attribute module 30 is a sound attribute module.
该信息获取模块20获取目标对象音频信息,该识别模块22根据该音频信息识别目标对象的音频特征。该声音属性模块根据该目标对象的音频特征获取对应的属性分级。比如,该属性分级的优先排序次序为:人声、语种、关键词以及语义。The information acquisition module 20 acquires target object audio information, and the recognition module 22 identifies an audio feature of the target object based on the audio information. The sound attribute module obtains a corresponding attribute level according to the audio feature of the target object. For example, the order of prioritization of the attribute hierarchy is: vocals, languages, keywords, and semantics.
该判断模块40从该人声属性开始往语义属性,基于该目标对象的音频特征进行逐级的属性判断。该置信模块42从该人声属性开始往语义属性,基于该目标对象的音频特征进行逐级置信度判断,该音频特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移。该音频特征不满足当前属性分级的分级标准时,该输出模块50输出当前属性分级及其以下优先级的属性分级的所有判断结果作为交互决策的依据。The determining module 40 starts from the vocal attribute to the semantic attribute, and performs stepwise attribute determination based on the audio feature of the target object. The confidence module 42 starts from the vocal attribute to the semantic attribute, and performs a step-by-step confidence judgment based on the audio feature of the target object. When the audio feature satisfies the grading standard of the current attribute grading, the attribute is hierarchically migrated to the attribute with higher priority. When the audio feature does not meet the grading criteria of the current attribute grading, the output module 50 outputs all the judgment results of the current attribute grading and the attribute grading of the following priority levels as the basis for the interactive decision.
请参考图8,例如,依次判断分析是否存在人声、人声语种、提取关键词、识别语义等。判断结果可以在考虑算法识别率的基础上做优化决策,并给出相应的交互输出。其决策顺序可以采用:人声、语种、关键词以及语义。比如,识别出人声,交互终端可以说“Hello!”;如果识别出语种,智交互终端可以用相应的语种说“您好!”;如果识别出关键词“理财”,智交互终端可以说“这里有中行最新的理财信息,不知您是否感兴趣”;如果识别出客户的用意“我想了解高利率理财”,交互终端可以说“高利率理财信息如下…”。Referring to FIG. 8, for example, it is sequentially determined whether there is a human voice, a vocal language, an extracted keyword, a recognition semantic, and the like. The judgment result can be optimized based on the recognition rate of the algorithm, and the corresponding interactive output is given. The order of decision making can be: vocal, language, keywords and semantics. For example, to recognize the vocal, the interactive terminal can say "Hello!"; if the language is recognized, the intelligent interactive terminal can say "Hello!" in the corresponding language; if the keyword "finance" is recognized, the intelligent interactive terminal can say "There is the Bank of China's latest financial information, I don't know if you are interested." If you identify the customer's intention "I want to understand high interest rate management," the interactive terminal can say "high interest rate financial information is as follows...".
实施例2Example 2
如图2所示,本实施例涉及云端服务器100,其中,用于实现分级交互决策的属性模块、判断模块以及置信模块设置在该云端服务器100中。As shown in FIG. 2, the embodiment relates to a cloud server 100, wherein an attribute module, a determining module, and a confidence module for implementing hierarchical interaction decision are disposed in the cloud server 100.
请参考图9,该云端服务器包括接收模块102、发送模块104、属性模块130、判断模块140、输出模块150以及置信模块142。Referring to FIG. 9 , the cloud server includes a receiving module 102 , a sending module 104 , an attribute module 130 , a determining module 140 , an output module 150 , and a confidence module 142 .
该接收模块102接收交互终端发送的根据获取的目标对象信息识别的目标对象特征。该属性模块130根据该目标对象特征获取对应的属性分级,该属性分级从初级属性到高级属性进行优先排序。 The receiving module 102 receives the target object feature identified by the interactive terminal according to the acquired target object information. The attribute module 130 acquires a corresponding attribute ranking according to the target object feature, and the attribute ranking is prioritized from the primary attribute to the advanced attribute.
该判断模块140从该初级属性开始往高级属性,基于该目标对象特征进行逐级的属性判断,该目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;该目标对象特征不满足当前属性分级的分级标准时,该输出模块150输出当前属性分级以及下级属性分级的所有判断结果作为交互决策的依据。The determining module 140 starts from the primary attribute to the advanced attribute, and performs stepwise attribute determination based on the target object feature, and the target object feature is hierarchically migrated to a higher priority attribute when the current attribute classification is satisfied; the target object feature When the grading standard of the current attribute grading is not satisfied, the output module 150 outputs all the judgment results of the current attribute grading and the subordinate attribute grading as the basis of the interactive decision.
该发送模块104发送该依据至连接的交互终端,该交互终端基于收到的交互决策依据与用户进行一定深度的交互。The sending module 104 sends the basis to the connected interactive terminal, and the interactive terminal performs a certain depth interaction with the user based on the received interaction decision.
该置信模块142对每一属性分级的判断结果进行置信度分析。The confidence module 142 performs a confidence analysis on the determination result of each attribute ranking.
该的云端服务器,该判断模块140在上级属性分级判断时以所有下级属性分级的判断结果作为依据。In the cloud server, the judging module 140 is based on the judgment result of all subordinate attribute gradings in the superior attribute grading judgment.
请参考图10,所示为云端服务器100一侧实现分级交互决策方法的流程图。Referring to FIG. 10, a flow chart of implementing a hierarchical interaction decision method on the cloud server 100 side is shown.
步骤301:接收模块接收交互终端发送的根据获取的目标对象信息识别的目标对象特征;Step 301: The receiving module receives, by the interaction terminal, a target object feature that is identified according to the acquired target object information.
步骤302:属性模块根据该目标对象特征获取对应的属性分级,该属性分级从初级属性到高级属性进行优先排序;Step 302: The attribute module acquires a corresponding attribute classification according to the target object feature, and the attribute level is prioritized from the primary attribute to the advanced attribute.
步骤303:该判断模块从该初级属性开始往高级属性,基于该目标对象特征进行逐级的属性判断;Step 303: The judging module starts from the primary attribute to the advanced attribute, and performs stepwise attribute determination based on the target object feature.
步骤304:置信模块对每一属性分级的判断结果进行置信度分析;Step 304: The confidence module performs a confidence analysis on the judgment result of each attribute classification.
步骤305:是否满足置信度阈值分级标准;Step 305: Whether the confidence threshold grading standard is met;
步骤306:是否满足当前属性分级的分级标准;Step 306: Whether the grading standard of the current attribute grading is satisfied;
步骤307:该该目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;Step 307: The target object feature is hierarchically migrated to a higher priority attribute when the grading standard of the current attribute grading is satisfied;
步骤308:该该目标对象特征不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据;Step 308: When the target object feature does not meet the grading standard of the current attribute grading, the judgment result of the current attribute grading and the following attribute grading is output as the basis of the interactive decision;
步骤309:该发送模块用于发送该依据。Step 309: The sending module is configured to send the basis.
实施例3Example 3
请参考图3,本实施例涉及分级交互决策方法,主要包括以下步骤:Referring to FIG. 3, the embodiment relates to a hierarchical interaction decision method, which mainly includes the following steps:
步骤101:获取目标对象信息,其中该目标对象信息包括图像信息以及音频信息;Step 101: Acquire target object information, where the target object information includes image information and audio information;
步骤102:识别该目标对象特征; Step 102: Identify the target object feature.
步骤103:根据该目标对象特征获取对应的属性分级,该属性分级从初级属性到高级属性进行优先排序;Step 103: Acquire a corresponding attribute ranking according to the target object feature, where the attribute ranking is prioritized from the primary attribute to the advanced attribute;
步骤104:从该初级属性开始往高级属性,基于该目标对象特征进行逐级的属性判断,上级属性分级在判断时以所有下级属性分级的判断结果作为依据;Step 104: Starting from the primary attribute to the advanced attribute, performing hierarchical attribute determination based on the target object feature, and the superior attribute leveling is based on the judgment result of all the lower attribute attributes in the judgment;
步骤105:该目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;Step 105: The target object feature is hierarchically migrated to a higher priority attribute when the grading standard of the current attribute grading is satisfied;
步骤106:该目标对象特征不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。Step 106: When the target object feature does not meet the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interaction decision.
请参考图4,该分级交互决策方法,基于该目标对象特征进行逐级的属性判断时还包括置信度分析步骤。Referring to FIG. 4, the hierarchical interaction decision method further includes a confidence analysis step when performing stepwise attribute determination based on the target object feature.
步骤201:对每一属性分级的判断结果进行置信度分析以保证识别准确率;Step 201: Perform a confidence analysis on the judgment result of each attribute classification to ensure the recognition accuracy rate;
步骤203:是否满足当前属性分级的分级标准,比如是否识别出人脸,如果识别出人脸则进一步判断是否可以人物性别等等;Step 203: Whether the grading standard of the current attribute grading is satisfied, for example, whether the face is recognized, and if the face is recognized, it is further determined whether the character gender, etc.;
步骤205:是否满足置信度阈值分级标准,比如,识别出人脸是否满足人脸图像数据的阈值分级标准,或者识别出性别是否满足设定图像特征阈值,比如头发长度等;Step 205: Whether the confidence threshold grading criterion is met, for example, whether the face meets the threshold grading standard of the face image data, or whether the gender satisfies the set image feature threshold, such as the length of the hair;
步骤207:该目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;Step 207: The target object feature is hierarchically migrated to a higher priority attribute when the grading standard of the current attribute grading is satisfied;
步骤209:该目标对象特征不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。Step 209: When the target object feature does not satisfy the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interaction decision.
本实施例的分级交互决策方法、交互终端以及云端服务器,预先设置目标对象的属性分级,属性分级之间进行优先级排序,根据机器视觉和机器人语义理解识别的目标对象特征逐级进行属性判断,并输出针对当前目标对象信息最多的交互决策依据,使得交互终端更加智能和灵活,提升用户体验。并且基于交互决策依据,交互终端与用户之间的交互内容更加丰富有趣味性,以识别人为例:在光照好、距离近的条件下正对机器摄像头时,可以识别出人名;用手遮住半张脸,或侧对机器摄像头时,只能识别出人的性别;背对机器摄像头时,只能识别出是否是个人,或者以识别车辆为例:盲人带着导盲头盔在路边走,有时可以识别出车型和颜色,有时只能识别出颜色。交互内容更加丰富有趣味性。The hierarchical interactive decision method, the interactive terminal and the cloud server of the embodiment pre-set the attribute grading of the target object, prioritize the attribute grading, and perform attribute judgment step by step according to the target object features recognized by the machine vision and the robot semantic understanding. The interactive decision basis for the current target object information is outputted, so that the interactive terminal is more intelligent and flexible, and the user experience is improved. And based on the basis of interactive decision-making, the interactive content between the interactive terminal and the user is more rich and interesting, and the recognition person is taken as an example: when the camera is facing the camera under the condition of good illumination and close proximity, the name of the person can be recognized; Half face, or side-to-machine camera, can only identify the gender of the person; when facing the camera, it can only identify whether it is an individual, or take the identification of the vehicle as an example: the blind person walks with the guide helmet on the side of the road Sometimes it is possible to identify the model and color, and sometimes only recognize the color. Interactive content is more interesting and interesting.
实施例4Example 4
图11是本申请实施例提供的分级交互决策方法的电子设备600的硬件结构 示意图,如图11所示,该电子设备600包括:11 is a hardware structure of an electronic device 600 for hierarchical hierarchical decision making according to an embodiment of the present application. Schematically, as shown in FIG. 11, the electronic device 600 includes:
一个或多个处理器610、存储器620、音频数据采集器630、视频数据采集器640、通信组件650以及显示单元660,图11中以一个处理器610为例。该音频数据采集器的输出为音频识别模块的输入,该视频数据采集器的输出视频识别模块的输入。该存储器620存储有可被该至少一个处理器610执行的指令,该指令被该至少一个处理器执行时调用音频数据采集器与视频数据采集器的数据,通过通信组件650与云端服务器建立连接,以使该至少一个处理器能够执行该分级交互决策方法。One or more processors 610, a memory 620, an audio data collector 630, a video data collector 640, a communication component 650, and a display unit 660 are illustrated by one processor 610 in FIG. The output of the audio data collector is the input of an audio recognition module, and the output of the video data collector identifies the input of the module. The memory 620 stores instructions executable by the at least one processor 610, the instructions being invoked by the at least one processor to invoke data of the audio data collector and the video data collector, and the communication component 650 establishes a connection with the cloud server. To enable the at least one processor to perform the hierarchical interaction decision method.
处理器610、存储器620、显示单元660以及人机交互单元630可以通过总线或者其他方式连接,图11中以通过总线连接为例。The processor 610, the memory 620, the display unit 660, and the human-machine interaction unit 630 may be connected by a bus or other means, and the connection by a bus is taken as an example in FIG.
存储器620作为一种非易失性计算机可读存储介质,可用于存储非易失性软件程序、非易失性计算机可执行程序以及模块,如本申请实施例中的分级交互决策方法对应的程序指令/模块(例如,附图2所示的识别模块22、属性模块30、判断模块40、置信模块42和交互模块60)。处理器610通过运行存储在存储器620中的非易失性软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述方法实施例中的分级交互决策方法。The memory 620 is a non-volatile computer readable storage medium, and can be used for storing a non-volatile software program, a non-volatile computer executable program, and a module, such as a program corresponding to the hierarchical interactive decision method in the embodiment of the present application. An instruction/module (for example, the identification module 22, the attribute module 30, the determination module 40, the confidence module 42 and the interaction module 60 shown in FIG. 2). The processor 610 executes various functional applications and data processing of the server by running non-volatile software programs, instructions, and modules stored in the memory 620, that is, implementing the hierarchical interaction decision method in the above method embodiments.
存储器620可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据交互终端的使用所创建的数据等。此外,存储器620可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实施例中,存储器620可选包括相对于处理器610远程设置的存储器,这些远程存储器可以通过网络连接至机器人交互电子设备。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 620 may include a storage program area and an storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the interactive terminal, and the like. Moreover, memory 620 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, memory 620 can optionally include memory remotely located relative to processor 610, which can be connected to the robotic interactive electronic device via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
所述一个或者多个模块存储在所述存储器620中,当被所述一个或者多个处理器610执行时,执行上述任意方法实施例中的分级交互决策方法,例如,执行以上描述的图3中的方法步骤101至步骤106,执行以上描述的图4中的方法步骤201至步骤209,实现图2中的识别模块22、属性模块30、判断模块40、置信模块42和交互模块60以及图9中属性模块130、判断模块140、置信模块142和发送模块104等的功能。The one or more modules are stored in the memory 620, and when executed by the one or more processors 610, perform a hierarchical interaction decision method in any of the above method embodiments, for example, performing the above described FIG. In steps 101 to 106 of the method, the method steps 201 to 209 in FIG. 4 described above are executed, and the identification module 22, the attribute module 30, the determination module 40, the confidence module 42 and the interaction module 60 and the diagram in FIG. 2 are implemented. 9 functions of the attribute module 130, the determination module 140, the confidence module 142, and the transmission module 104.
上述产品可执行本申请实施例所提供的方法,具备执行方法相应的功能模块和有益效果。未在本实施例中详尽描述的技术细节,可参见本申请实施例所提供的方法。The above products can perform the methods provided by the embodiments of the present application, and have the corresponding functional modules and beneficial effects of the execution method. For technical details that are not described in detail in this embodiment, reference may be made to the method provided by the embodiments of the present application.
本申请实施例的电子设备以多种形式存在,包括但不限于:The electronic device of the embodiment of the present application exists in various forms, including but not limited to:
(1)移动通信设备:这类设备的特点是具备移动通信功能。 (1) Mobile communication equipment: This type of equipment is characterized by its mobile communication function.
(2)三维显示设备:这类设备可以显示和播放多媒体内容。该类设备包括:虚拟现实头盔、增强显示头盔,或者增强显示眼镜。(2) Three-dimensional display devices: These devices can display and play multimedia content. Such devices include: virtual reality helmets, enhanced display helmets, or enhanced display glasses.
(3)服务器:提供计算服务的设备,服务器的构成包括处理器、硬盘、内存、系统总线等,服务器和通用的计算机架构类似,但是由于需要提供高可靠的服务,因此在处理能力、稳定性、可靠性、安全性、可扩展性、可管理性等方面要求较高。(3) Server: A device that provides computing services. The server consists of a processor, a hard disk, a memory, a system bus, etc. The server is similar to a general-purpose computer architecture, but because of the need to provide highly reliable services, processing power and stability High reliability in terms of reliability, security, scalability, and manageability.
(4)机器人以及导盲装置等。(4) Robots and blind guides.
本申请实施例提供了一种非易失性计算机可读存储介质,所述计算机可读存储介质存储有计算机可执行指令,该计算机可执行指令被一个或多个处理器执行,例如,执行以上描述的图3中的方法步骤101至步骤106,执行以上描述的图4中的方法步骤201至步骤209,实现图2中的识别模块22、属性模块30、判断模块40、置信模块42和交互模块60以及图9中属性模块130、判断模块140、置信模块142和发送模块104等的功能。Embodiments of the present application provide a non-transitory computer readable storage medium storing computer-executable instructions that are executed by one or more processors, for example, to perform the above The method steps 101 to 106 in FIG. 3 described above, the method steps 201 to 209 in FIG. 4 described above are performed, and the identification module 22, the attribute module 30, the determination module 40, the confidence module 42 and the interaction in FIG. 2 are implemented. The functions of the module 60 and the attribute module 130, the determination module 140, the confidence module 142, and the transmission module 104 in FIG.
以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
通过以上的实施方式的描述,本领域普通技术人员可以清楚地了解到各实施方式可借助软件加通用硬件平台的方式来实现,当然也可以通过硬件。本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)或随机存储记忆体(Random Access Memory,RAM)等。Through the description of the above embodiments, those skilled in the art can clearly understand that the various embodiments can be implemented by means of software plus a general hardware platform, and of course, by hardware. A person skilled in the art can understand that all or part of the process of implementing the above embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable storage medium. When executed, the flow of an embodiment of the methods as described above may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;在本申请的思路下,以上实施例或者不同实施例中的技术特征之间也可以进行组合,步骤可以以任意顺序实现,并存在如上所述的本申请的不同方面的许多其它变化,为了简明,它们没有在细节中提供;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的范围。 Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application, and are not limited thereto; in the idea of the present application, the technical features in the above embodiments or different embodiments may also be combined. The steps may be carried out in any order, and there are many other variations of the various aspects of the present application as described above, which are not provided in the details for the sake of brevity; although the present application has been described in detail with reference to the foregoing embodiments, The skilled person should understand that the technical solutions described in the foregoing embodiments may be modified, or some of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the embodiments of the present application. The scope of the technical solution.

Claims (20)

  1. 一种分级交互决策方法,其特征在于,包括以下步骤:A hierarchical interactive decision making method, comprising the steps of:
    获取目标对象信息,识别目标对象特征;Obtain target object information and identify target object features;
    根据所述目标对象特征获取对应的属性分级,对所述属性分级进行优先排序;Obtaining a corresponding attribute ranking according to the target object feature, and prioritizing the attribute ranking;
    根据所述属性分级的优先排序次序,对所述目标对象特征进行逐级的属性判断,所述目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;And performing hierarchical attribute determination on the target object feature according to a prioritized order of the attribute ranking, where the target object feature is hierarchically migrated to a higher priority attribute when the current attribute classification is satisfied;
    所述目标对象特征不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the target object feature does not satisfy the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interactive decision.
  2. 根据权利要求1所述的方法,其特征在于,基于所述目标对象特征进行逐级的属性判断之后还包括:The method according to claim 1, wherein the stepwise attribute determination based on the target object feature further comprises:
    对每一属性分级的判断结果进行置信度分析。Confidence analysis is performed on the judgment result of each attribute classification.
  3. 根据权利要求2所述的方法,其特征在于,上级属性分级在判断时以所有下级属性分级的判断结果作为依据。The method according to claim 2, wherein the superior attribute ranking is based on the judgment result of all the lower attribute classifications at the time of the judgment.
  4. 根据权利要求3所述的方法,其特征在于,所述目标对象信息包括图像信息以及音频信息。The method according to claim 3, wherein the target object information comprises image information and audio information.
  5. 根据权利要求1-4任一项所述的方法,其特征在于,A method according to any one of claims 1 to 4, characterized in that
    获取人脸的图像信息,根据所述图像信息识别人脸特征;Obtaining image information of a face, and identifying a face feature according to the image information;
    根据所述人脸特征获取对应的属性分级,所述属性分级的优先排序次序为:人、性别、人名以及表情;Obtaining a corresponding attribute ranking according to the facial feature, the prioritized order of the attribute ranking is: a person, a gender, a person name, and an expression;
    从所述人属性开始往表情属性,对所述人脸特征进行逐级的属性判断和置信度判断,所述人脸特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;Starting from the person attribute to the expression attribute, performing stepwise attribute judgment and confidence judgment on the face feature, and the face feature is hierarchically migrated to a higher priority attribute when the current attribute classification is satisfied;
    所述人脸特征不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the facial feature does not satisfy the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interactive decision.
  6. 根据权利要求1-4任一项所述的方法,其特征在于,A method according to any one of claims 1 to 4, characterized in that
    获取车辆的图像信息,根据所述图像信息识别车辆特征;Acquiring image information of the vehicle, and identifying the vehicle feature according to the image information;
    根据所述车辆特征获取对应的车辆属性分级以及所述人脸属性分级对应的分级标准,所述属性分级的优先排序次序为:车、颜色、车型、品牌以及款式; And obtaining, according to the vehicle feature, a corresponding vehicle attribute classification and a classification criterion corresponding to the facial attribute classification, wherein the priority ranking order of the attribute classification is: a car, a color, a model, a brand, and a style;
    从所述车属性开始往型号属性,对所述车辆特征进行逐级的属性判断和置信度判断,所述车辆特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;Starting from the vehicle attribute to the model attribute, performing stepwise attribute determination and confidence determination on the vehicle feature, and classifying the attribute to a higher priority attribute when the vehicle feature satisfies the current attribute classification grading standard;
    所述车辆特征不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the vehicle feature does not satisfy the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interactive decision.
  7. 根据权利要求1-4任一项所述的方法,其特征在于,A method according to any one of claims 1 to 4, characterized in that
    获取目标对象音频信息,根据所述音频信息识别目标对象的音频特征;Obtaining target object audio information, and identifying an audio feature of the target object according to the audio information;
    根据所述目标对象的音频特征获取对应的属性分级,所述属性分级的优先排序次序为:人声、语种、关键词以及语义;Acquiring corresponding attribute rankings according to the audio features of the target object, the priority order of the attribute rankings is: vocals, languages, keywords, and semantics;
    从所述人声属性开始往语义属性,对所述目标对象的特征进行逐级的属性判断和置信度判断,所述音频特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;Starting from the vocal attribute to the semantic attribute, performing stepwise attribute judgment and confidence judgment on the feature of the target object, and the audio feature is hierarchically migrated to a higher priority attribute when the grading standard of the current attribute grading is satisfied;
    所述音频特征不满足当前属性分级的分级标准时,输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the audio feature does not satisfy the grading standard of the current attribute grading, the judgment result of the attribute grading of the current attribute grading and the following priority is output as the basis of the interactive decision.
  8. 一种交互终端,其特征在于,包括信息获取模块、识别模块、属性模块、判断模块以及输出模块,An interactive terminal, comprising: an information acquisition module, an identification module, an attribute module, a judgment module, and an output module,
    所述信息获取模块用于获取目标对象信息,所述识别模块用于识别目标对象特征;The information acquiring module is configured to acquire target object information, and the identifying module is configured to identify a target object feature;
    所述属性模块用于根据所述目标对象特征获取对应的属性分级,对所述属性分级进行优先排序;The attribute module is configured to obtain a corresponding attribute classification according to the target object feature, and prioritize the attribute classification;
    所述判断模块用于根据所述属性分级的优先排序次序,对所述目标对象特征进行逐级的属性判断,所述目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The determining module is configured to perform hierarchical attribute determination on the target object feature according to a prioritized order of the attribute ranking, and the target object feature is hierarchically migrated to a higher priority attribute when the current attribute classification is satisfied by the current attribute classification ;
    所述目标对象特征不满足当前属性分级的分级标准时,所述输出模块用于输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the target object feature does not meet the grading criteria of the current attribute grading, the output module is configured to output the judgment result of the attribute grading of the current attribute grading and the following priority as the basis of the interaction decision.
  9. 根据权利要求8所述的交互终端,其特征在于,所述判断模块还包括置信模块,所述置信模块用于对每一属性分级的判断结果进行置信度分析。The interactive terminal according to claim 8, wherein the determining module further comprises a confidence module, wherein the confidence module is configured to perform a confidence analysis on the determination result of each attribute ranking.
  10. 根据权利要求9所述的交互终端,其特征在于,所述判断模块在上级属性分级判断时以所有下级属性分级的判断结果作为依据。The interactive terminal according to claim 9, wherein the judging module is based on the judgment result of all subordinate attribute gradings in the superior attribute grading judgment.
  11. 根据权利要求10所述的交互终端,其特征在于,所述目标对象信息包括图像信息以及音频信息,所述交互终端为机器人或者可穿戴显示设备或者移 动终端或者导盲装置。The interactive terminal according to claim 10, wherein the target object information comprises image information and audio information, and the interactive terminal is a robot or a wearable display device or Mobile terminal or guide blind device.
  12. 根据权利要求8-11任一项所述的交互终端,其特征在于,所述属性模块为人脸属性模块;The interactive terminal according to any one of claims 8 to 11, wherein the attribute module is a face attribute module;
    所述信息获取模块用于获取人脸的图像信息,所述识别模块用于根据所述图像信息识别人脸特征;The information acquiring module is configured to acquire image information of a human face, and the identifying module is configured to identify a facial feature according to the image information;
    所述人脸属性模块用于根据所述人脸特征获取对应的属性分级,所述属性分级的优先排序次序为:人、性别、人名以及表情;The face attribute module is configured to obtain a corresponding attribute ranking according to the facial feature, and the priority order of the attribute ranking is: a person, a gender, a person name, and an expression;
    所述判断模块包括置信模块,所述判断模块用于从所述人属性开始往表情属性,基于所述人脸特征进行逐级的属性判断,所述置信模块用于从所述人属性开始往表情属性,基于所述人脸特征进行逐级置信度判断,所述人脸特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The determining module includes a confidence module, and the determining module is configured to start from the human attribute to an expression attribute, and perform stepwise attribute determination based on the facial feature, where the confidence module is configured to start from the human attribute An expression attribute, which is determined based on the face feature, and the face feature is hierarchically migrated to a higher priority attribute when the face feature meets the current attribute classification level criterion;
    所述人脸特征不满足当前属性分级的分级标准时,所述输出模块用于输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the facial feature does not meet the grading standard of the current attribute grading, the output module is configured to output the current attribute grading and the following attribute grading result as the basis of the interactive decision.
  13. 根据权利要求8-11任一项所述的交互终端,其特征在于,所述属性模块为车辆属性模块;The interactive terminal according to any one of claims 8 to 11, wherein the attribute module is a vehicle attribute module;
    所述信息获取模块用于获取车辆的图像信息,所述识别模块用于根据所述图像信息识别车辆特征;The information acquiring module is configured to acquire image information of a vehicle, and the identifying module is configured to identify a vehicle feature according to the image information;
    所述车辆属性模块用于根据所述车辆特征获取对应的属性分级,所述属性分级的优先排序次序为:车、颜色、车型、品牌以及款式;The vehicle attribute module is configured to obtain a corresponding attribute ranking according to the vehicle feature, and the priority order of the attribute ranking is: a car, a color, a model, a brand, and a style;
    所述判断模块包括置信模块,所述判断模块用于从所述车属性开始往款式属性,基于所述车辆特征进行逐级的属性判断,所述置信模块用于从所述车属性开始往款式属性,基于所述车辆特征进行逐级置信度判断,所述车辆特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The judging module includes a confidence module, and the judging module is configured to perform a stepwise attribute determination based on the vehicle characteristics from the vehicle attribute to the style attribute, where the confidence module is used to start from the vehicle attribute to the style Attribute, performing a stepwise confidence determination based on the vehicle characteristics, and the vehicle feature is hierarchically migrated to a higher priority attribute when the grading standard of the current attribute grading is satisfied;
    所述车辆特征不满足当前属性分级的分级标准时,所述输出模块用于输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the vehicle feature does not meet the grading standard of the current attribute grading, the output module is configured to output the judgment result of the attribute grading of the current attribute grading and the following priority as the basis of the interaction decision.
  14. 根据权利要求8-11任一项所述的交互终端,其特征在于,所述属性模块为声音属性模块;The interactive terminal according to any one of claims 8 to 11, wherein the attribute module is a sound attribute module;
    所述信息获取模块用于获取目标对象音频信息,所述识别模块用于根据所述音频信息识别目标对象的音频特征;The information acquiring module is configured to acquire target object audio information, and the identifying module is configured to identify an audio feature of the target object according to the audio information;
    所述声音属性模块用于根据所述目标对象的音频特征获取对应的属性分级,所述属性分级的优先排序次序为:人声、语种、关键词以及语义;The sound attribute module is configured to obtain a corresponding attribute ranking according to an audio feature of the target object, and the priority order of the attribute ranking is: a voice, a language, a keyword, and a semantic;
    所述判断模块包括置信模块,所述判断模块用于从所述人声属性开始往语 义属性,基于所述目标对象的音频特征进行逐级的属性判断,所述置信模块用于从所述人声属性开始往语义属性,基于所述目标对象的音频特征进行逐级置信度判断,所述音频特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The determining module includes a confidence module, and the determining module is configured to start with the vocal attribute a semantic attribute, performing a stepwise attribute determination based on an audio feature of the target object, the confidence module is configured to start from the vocal attribute to a semantic attribute, and perform a stepwise confidence determination based on an audio feature of the target object, When the audio feature satisfies the grading standard of the current attribute grading, the grading migration is performed to the attribute with higher priority;
    所述音频特征不满足当前属性分级的分级标准时,所述输出模块用于输出当前属性分级及其以下优先级的属性分级的判断结果作为交互决策的依据。When the audio feature does not meet the grading standard of the current attribute grading, the output module is configured to output a judgment result of the attribute grading of the current attribute grading and the following priority as the basis of the interaction decision.
  15. 一种云端服务器,其特征在于,包括接收模块、属性模块、判断模块、输出模块以及发送模块,A cloud server, comprising: a receiving module, an attribute module, a determining module, an output module, and a sending module,
    所述接收模块用于接收交互终端发送的根据获取的目标对象信息识别的目标对象特征;The receiving module is configured to receive, by the interaction terminal, a target object feature that is identified according to the acquired target object information;
    所述属性模块用于根据所述目标对象特征获取对应的属性分级,对所述属性分级进行优先排序;The attribute module is configured to obtain a corresponding attribute classification according to the target object feature, and prioritize the attribute classification;
    所述判断模块用于根据所述属性分级的优先排序次序,基于所述目标对象特征进行逐级的属性判断,所述目标对象特征满足当前属性分级的分级标准时往优先级更高的属性分级迁移;The determining module is configured to perform hierarchical attribute determination based on the target object feature according to the priority order of the attribute ranking, and the target object feature is hierarchically migrated to a higher priority attribute when the current attribute classification is satisfied by the current attribute classification ;
    所述目标对象特征不满足当前属性分级的分级标准时,所述输出模块用于输出当前属性分级以及下级属性分级的判断结果作为交互决策的依据;When the target object feature does not meet the grading standard of the current attribute grading, the output module is configured to output the current attribute grading and the subordinate attribute grading judgment result as the basis of the interaction decision;
    所述发送模块用于发送所述依据。The sending module is configured to send the basis.
  16. 根据权利要求15所述的云端服务器,其特征在于,所述判断模块还包括置信模块,所述置信模块用于对每一属性分级的判断结果进行置信度分析。The cloud server according to claim 15, wherein the determining module further comprises a confidence module, wherein the confidence module is configured to perform a confidence analysis on the judgment result of each attribute ranking.
  17. 根据权利要求16所述的云端服务器,其特征在于,所述判断模块在上级属性分级判断时以所有下级属性分级的判断结果作为依据。The cloud server according to claim 16, wherein the judging module is based on the judgment result of all subordinate attribute gradings in the judging of the superior attribute grading.
  18. 一种电子设备,其中,包括:An electronic device, comprising:
    至少一个处理器;以及,At least one processor; and,
    与所述至少一个处理器通信连接的存储器,通信组件、音频数据采集器以及视频数据采集器;其中,a memory communicatively coupled to the at least one processor, a communication component, an audio data collector, and a video data collector; wherein
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行时调用音频数据采集器与视频数据采集器的数据,通过通信组件与云端服务器建立连接,以使所述至少一个处理器能够执行权利要求1-7任一项所述的方法。The memory stores instructions executable by the at least one processor, the instructions being invoked by the at least one processor to invoke data of an audio data collector and a video data collector, and establishing a connection with a cloud server through a communication component To enable the at least one processor to perform the method of any of claims 1-7.
  19. 一种非易失性计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机可执行指令,所述计算机可执行指令用于使计算机执行权利要求1-7 任一项所述的方法。A non-transitory computer readable storage medium, wherein the computer readable storage medium stores computer executable instructions for causing a computer to perform the claims 1-7 The method of any of the preceding claims.
  20. 一种计算机程序产品,其中,所述计算机程序产品包括存储在非易失性计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行权利要求1-7任一项所述的方法。 A computer program product, comprising: a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program instructions are executed by a computer, The computer performs the method of any of claims 1-7.
PCT/CN2017/102746 2017-09-21 2017-09-21 Hierarchical interactive decision making method, interactive terminal, and cloud server WO2019056267A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2017/102746 WO2019056267A1 (en) 2017-09-21 2017-09-21 Hierarchical interactive decision making method, interactive terminal, and cloud server
CN201780001795.XA CN107820619B (en) 2017-09-21 2017-09-21 hierarchical interaction decision-making method, interaction terminal and cloud server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/102746 WO2019056267A1 (en) 2017-09-21 2017-09-21 Hierarchical interactive decision making method, interactive terminal, and cloud server

Publications (1)

Publication Number Publication Date
WO2019056267A1 true WO2019056267A1 (en) 2019-03-28

Family

ID=61606891

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/102746 WO2019056267A1 (en) 2017-09-21 2017-09-21 Hierarchical interactive decision making method, interactive terminal, and cloud server

Country Status (2)

Country Link
CN (1) CN107820619B (en)
WO (1) WO2019056267A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110175150A (en) * 2019-05-15 2019-08-27 重庆大学 Guest-greeting machine personal data based on data compression stores monitoring system
CN111783643A (en) * 2020-06-30 2020-10-16 北京百度网讯科技有限公司 Face recognition method and device, electronic equipment and storage medium
CN112035034A (en) * 2020-08-27 2020-12-04 芜湖盟博科技有限公司 Vehicle-mounted robot interaction method

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108762250A (en) * 2018-04-27 2018-11-06 深圳市商汤科技有限公司 The control method and device of equipment, equipment, computer program and storage medium
CN109117819B (en) * 2018-08-30 2021-03-02 Oppo广东移动通信有限公司 Target object identification method and device, storage medium and wearable device
CN110349577B (en) * 2019-06-19 2022-12-06 达闼机器人股份有限公司 Man-machine interaction method and device, storage medium and electronic equipment
CN110852785B (en) * 2019-10-12 2023-11-21 中国平安人寿保险股份有限公司 User grading method, device and computer readable storage medium
CN110837326B (en) * 2019-10-24 2021-08-10 浙江大学 Three-dimensional target selection method based on object attribute progressive expression
CN114612959A (en) * 2022-01-28 2022-06-10 北京深睿博联科技有限责任公司 Face recognition system and method for assisting blind person in interpersonal communication

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9275269B1 (en) * 2012-11-09 2016-03-01 Orbeus, Inc. System, method and apparatus for facial recognition
CN106022254A (en) * 2016-05-17 2016-10-12 上海民实文化传媒有限公司 Image recognition technology
CN106372576A (en) * 2016-08-23 2017-02-01 南京邮电大学 Deep learning-based intelligent indoor intrusion detection method and system
CN106796790A (en) * 2016-11-16 2017-05-31 深圳达闼科技控股有限公司 Method for distinguishing and correlation machine people's device are known in robot voice instruction

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7415137B2 (en) * 2002-12-13 2008-08-19 Canon Kabushiki Kaisha Image processing method, apparatus and storage medium
US8812417B2 (en) * 2012-08-20 2014-08-19 InsideSales.com, Inc. Hierarchical based sequencing machine learning model
US8352389B1 (en) * 2012-08-20 2013-01-08 Insidesales.com Multiple output relaxation machine learning model
CN104143079B (en) * 2013-05-10 2016-08-17 腾讯科技(深圳)有限公司 The method and system of face character identification
CN105563484B (en) * 2015-12-08 2018-04-10 深圳达闼科技控股有限公司 Cloud robot system, robot and robot cloud platform
CN105404877A (en) * 2015-12-08 2016-03-16 商汤集团有限公司 Human face attribute prediction method and apparatus based on deep study and multi-task study

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9275269B1 (en) * 2012-11-09 2016-03-01 Orbeus, Inc. System, method and apparatus for facial recognition
CN106022254A (en) * 2016-05-17 2016-10-12 上海民实文化传媒有限公司 Image recognition technology
CN106372576A (en) * 2016-08-23 2017-02-01 南京邮电大学 Deep learning-based intelligent indoor intrusion detection method and system
CN106796790A (en) * 2016-11-16 2017-05-31 深圳达闼科技控股有限公司 Method for distinguishing and correlation machine people's device are known in robot voice instruction

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110175150A (en) * 2019-05-15 2019-08-27 重庆大学 Guest-greeting machine personal data based on data compression stores monitoring system
CN110175150B (en) * 2019-05-15 2023-02-24 重庆大学 Welcome robot data storage monitoring system based on data compression
CN111783643A (en) * 2020-06-30 2020-10-16 北京百度网讯科技有限公司 Face recognition method and device, electronic equipment and storage medium
CN111783643B (en) * 2020-06-30 2023-09-01 北京百度网讯科技有限公司 Face recognition method and device, electronic equipment and storage medium
CN112035034A (en) * 2020-08-27 2020-12-04 芜湖盟博科技有限公司 Vehicle-mounted robot interaction method

Also Published As

Publication number Publication date
CN107820619A (en) 2018-03-20
CN107820619B (en) 2019-12-10

Similar Documents

Publication Publication Date Title
WO2019056267A1 (en) Hierarchical interactive decision making method, interactive terminal, and cloud server
CN109643158B (en) Command processing using multi-modal signal analysis
CN107995982B (en) A kind of target identification method, device and intelligent terminal
CN106294774A (en) User individual data processing method based on dialogue service and device
CN112861975B (en) Classification model generation method, classification device, electronic equipment and medium
US9932000B2 (en) Information notification apparatus and information notification method
WO2017183242A1 (en) Information processing device and information processing method
KR102129698B1 (en) Automatic fish counting system
US11036990B2 (en) Target identification method and apparatus, and intelligent terminal
CN109286848B (en) Terminal video information interaction method and device and storage medium
CN110019777A (en) A kind of method and apparatus of information classification
CN110465089A (en) Map heuristic approach, device, medium and electronic equipment based on image recognition
CN115049057A (en) Model deployment method and device, electronic equipment and storage medium
CN112528004A (en) Voice interaction method, voice interaction device, electronic equipment, medium and computer program product
CN105631404A (en) Method and device for clustering pictures
CN111432206A (en) Video definition processing method and device based on artificial intelligence and electronic equipment
WO2024001539A1 (en) Speaking state recognition method and apparatus, model training method and apparatus, vehicle, medium, computer program and computer program product
WO2023197648A1 (en) Screenshot processing method and apparatus, electronic device, and computer readable medium
KR20210048271A (en) Apparatus and method for performing automatic audio focusing to multiple objects
WO2023231211A1 (en) Voice recognition method and apparatus, electronic device, storage medium, and product
CN115062131A (en) Multi-mode-based man-machine interaction method and device
CN113093907B (en) Man-machine interaction method, system, equipment and storage medium
KR101867586B1 (en) Apparatus and method for processing image
CN113949827B (en) Video content fusion method and device
CN117576245A (en) Method and device for converting style of image, electronic equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17925944

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17925944

Country of ref document: EP

Kind code of ref document: A1