WO2022161026A1 - Action recognition method and apparatus, and electronic device and storage medium - Google Patents

Action recognition method and apparatus, and electronic device and storage medium Download PDF

Info

Publication number
WO2022161026A1
WO2022161026A1 PCT/CN2021/139746 CN2021139746W WO2022161026A1 WO 2022161026 A1 WO2022161026 A1 WO 2022161026A1 CN 2021139746 W CN2021139746 W CN 2021139746W WO 2022161026 A1 WO2022161026 A1 WO 2022161026A1
Authority
WO
WIPO (PCT)
Prior art keywords
action
electronic device
bluetooth headset
sensor data
state
Prior art date
Application number
PCT/CN2021/139746
Other languages
French (fr)
Chinese (zh)
Inventor
裴璇
郭彦东
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Publication of WO2022161026A1 publication Critical patent/WO2022161026A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • G06F3/012Head tracking input arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1016Earpieces of the intra-aural type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1041Mechanical or electronic switches, or control elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1091Details not provided for in groups H04R1/1008 - H04R1/1083
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/10Details of earpieces, attachments therefor, earphones or monophonic headphones covered by H04R1/10 but not provided for in any of its subgroups
    • H04R2201/109Arrangements to adapt hands free headphones for use on both ears
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Definitions

  • the present application relates to the technical field of human motion recognition, and more particularly, to a motion recognition method, device, electronic device, and storage medium.
  • Human action mainly refers to the way the human body moves and the human response to the environment or objects.
  • the human body describes or expresses complex human actions through the complex movements of the limbs. It can be said that most of the actions of the human body need to be reflected through the movement of the human body. It is a very effective way to analyze the movement of the human body by studying and exploring the movement of the human body.
  • the embodiments of the present application propose a motion recognition method, apparatus, electronic device, and storage medium to improve the above problems.
  • an embodiment of the present application provides an action recognition method, the method includes: acquiring sensor data with a preset window length in a time series; inputting the sensor data into an action recognition model, and acquiring the action recognition model The output action classification result; if it is determined that an action occurs based on the action classification result, the electronic device is controlled to respond to an operation corresponding to the action.
  • an embodiment of the present application provides a motion recognition device, the device includes: a data acquisition unit for acquiring sensor data of a preset window length on a time series; a result output unit for The data is input into the action recognition model, and the action classification result output by the action recognition model is obtained; the operation execution unit is used to control the electronic device to respond to the action corresponding to the action if it is determined that an action occurs based on the action classification result. operate.
  • embodiments of the present application provide an electronic device, including one or more processors and a memory; one or more programs are stored in the memory and configured to be executed by the one or more processors Execute to implement the above method.
  • an embodiment of the present application provides a computer-readable storage medium, where a program code is stored in the computer-readable storage medium, wherein the above method is executed when the program code is executed by a processor.
  • FIG. 1 shows a schematic diagram of an application environment of an action recognition method proposed by an embodiment of the present application
  • FIG. 2 shows a flowchart of an action recognition method proposed by an embodiment of the present application
  • FIG. 3 shows a schematic structural diagram of an action recognition model proposed by an embodiment of the present application
  • FIG. 4 shows a schematic diagram of a shaking head motion according to an embodiment of the present application
  • FIG. 5 shows a schematic diagram of a shaking motion without shaking according to an embodiment of the present application
  • FIG. 6 shows a flowchart of an action recognition method proposed by another embodiment of the present application.
  • FIG. 7 shows a flowchart of an action recognition method proposed by another embodiment of the present application.
  • FIG. 8 shows a structural block diagram of a motion recognition device proposed by an embodiment of the present application.
  • FIG. 9 shows a structural block diagram of another motion recognition device proposed by an embodiment of the present application.
  • FIG. 10 shows a structural block diagram of still another action recognition device proposed by an embodiment of the present application.
  • FIG. 11 shows a structural block diagram of an electronic device for executing the motion recognition method according to an embodiment of the present application in real time of the present application
  • FIG. 12 shows a storage unit in real time of the present application for storing or carrying program codes for implementing the motion recognition method according to the embodiment of the present application.
  • Human action mainly refers to the way the human body moves and the human response to the environment or objects.
  • the human body describes or expresses complex human actions through the complex movements of the limbs. It can be said that most of the actions of the human body need to be reflected through the movement of the human body. It is a very effective way to analyze the movement of the human body by studying and exploring the movement of the human body.
  • the inventor proposes the method, device, electronic device, and storage medium for motion recognition in the embodiments of the present application.
  • the motion recognition model is obtained.
  • the output action classification result and finally, if it is determined that an action occurs based on the action classification result, the electronic device is controlled to respond to an operation corresponding to the action.
  • the sensor data of the preset window length in the acquired time series can be recognized by the motion recognition model, which can quickly and accurately identify whether there is an action, which improves the accuracy of the action recognition, and can then determine when an action occurs.
  • the control electronic device responds in real time to the operation corresponding to the action.
  • the motion recognition method provided by the implementation of the present invention can be applied to a human-computer interaction system 100
  • the human-computer interaction system 100 may include an electronic device 110 and a head-mounted device 120 .
  • the electronic device 110 may be a smart phone, a tablet computer, a smart wearable device (such as a smart bracelet, a smart watch, etc.), a smart screen, a gateway, a vehicle-mounted device, a laptop, etc.
  • the device 120 may be a headset, such as a wireless Bluetooth headset or a wired headset.
  • the electronic device 110 and the head-mounted device 120 may be connected wirelessly, or may be connected by a physical connection line.
  • the electronic device 110 and the head mounted device 120 may establish a communication link through a wireless communication protocol, where the wireless communication protocol may include a Wlan protocol, a Bluetooth protocol, or a ZigBee protocol.
  • the electronic device 110 may include a motion recognition module for performing motion recognition on the sensor data collected by the head mounted device 120, and then the electronic device 110 may be controlled through the motion recognition result.
  • the electronic device 110 may be connected to one head mounted device 120 , or may be connected to multiple head mounted devices 120 .
  • the multiple head-mounted devices may be two or more Bluetooth headsets.
  • the electronic device 110 may also establish a network connection with other electronic devices through a wired network or a wireless network. Such as via Wi-Fi connection, via mobile wireless network connection, etc.
  • an action recognition method provided by an embodiment of the present application is applied to an electronic device, and the method includes:
  • S110 Acquire sensor data of a preset window length on a time series.
  • the preset window length is a window with a certain length preset in the electronic device, the length of the window can be set to 64ms, and the step size of each sliding of the window in the time series can be set to 12ms; the sensor data is the data collected and sent by the head-mounted device in real time.
  • connection state between the electronic device and the head-mounted device may include a connected state and a non-connected state, wherein the non-connected state includes a non-connected state and a connection interruption state.
  • the connection state between the electronic device and the head-mounted device can be judged by checking the state value of the electronic device.
  • two different state values can be set for the electronic device in advance.
  • the first state value is returned, and when the electronic device is not connected to the head-mounted device, the second state value is returned, so that whether the electronic device is connected to the head-mounted device can be determined by detecting the first state value and the second state value.
  • the device is connected.
  • the first state value of the electronic device is set to 1 in advance
  • the second state value of the electronic device is set to 0. If the state value of the electronic device is detected to be 1, it is determined that the electronic device is connected to the head-mounted device.
  • the electronic device and the headset are in a disconnected state.
  • the electronic device sends a broadcast when the head-mounted device is connected and disconnected, so the electronic device can determine whether the electronic device is in a connected state with the head-mounted device by monitoring the broadcast.
  • the acquisition of the sensor data of the preset window length in the time series is to acquire the sensor data sent by the head-mounted device in the time sequence.
  • the head-mounted device collects sensor data in real time, and collects sensor data from the electronic device in real time.
  • the electronic device saves the sensor data in real time.
  • the preset window length is set.
  • the sensor data is sent to the motion recognition module in the electronic device.
  • S120 Input the sensor data into an action recognition model, and obtain an action classification result output by the action recognition model.
  • the action recognition model includes a first convolutional layer, a second convolutional layer, a maximum pooling layer, a third convolutional layer, a fourth convolutional layer, and a global average pooling layer that are connected in sequence , the fully connected layer and the softmax layer;
  • the first convolutional layer and the second convolutional layer are convolutional layers with a convolution kernel of 7 and a dimension of 64;
  • the convolutional layer is a convolutional layer with a convolution kernel of 7 and a dimension of 128.
  • the structure of the action recognition model is shown in Figure 3.
  • the functions of the first convolutional layer and the second convolutional layer are to extract the features of the sensor data with a preset window length on the acquired time series; the function of the maximum pooling layer It is to reduce the dimension of the sensor data of the preset window length on the acquired time series and the translation invariance of the sensor data of the preset window length on the acquired time series to a certain extent; the third convolution layer and the fourth convolution layer two
  • the function of each convolutional layer is to further extract the high-order features of the sensor data with a preset window length on the acquired time series and increase the dimension to maintain the richness of information after the feature scale is reduced; the global average pooling layer
  • the function is to collect the features detected at each position in the sensor data of the preset window length on the acquired time series to enhance the translation invariance; the function of the fully connected layer is to convert all the features into the logits value of each action category; Softmax The role of the layer is to convert the logits value into a probability value that sums to 1.
  • a Relu activation function is added after each convolutional layer to enhance the non-linear capability of the action recognition model.
  • a dropout layer with a probability of 0.5 is added before the fully connected layer.
  • the function of the dropout layer is to randomly set the value of half of the neurons to 0, according to the remaining half The neurons are used to predict the result to enhance the generalization ability of the action recognition model.
  • the loss function used in the action recognition model is a cross-entropy loss function.
  • the formula of the cross entropy loss function is as follows:
  • M represents the number of action categories
  • y ic represents the indicator variable (0 or 1), if the predicted action category is the same as the action category of the observed sample i, it is 0, if the predicted action category is the same as the observed sample i. The difference is 1
  • pic represents the predicted probability that the observed sample i belongs to the action category c.
  • the action recognition model is iteratively trained to obtain the gradient of the action recognition model, and the stochastic gradient descent method is used to update the parameters of the action recognition model until the maximum number of iterations is reached, and the trained action recognition model is obtained.
  • the action recognition model is a convolutional neural network model.
  • the motion recognition model is in the motion recognition module of the electronic device, when the motion recognition module receives the sensor data of the preset window length, the sensor data of the preset window length is input into the motion recognition model, and the action The recognition model infers the sensor data of the preset window length, outputs the action classification result, and returns the action classification result to the electronic device.
  • the action may be a nodding action and a shaking head action, and the action may also be other head actions, such as turning the head to the left, turning the head to the right, etc., which are not specifically limited here.
  • the operation corresponding to the action can be a preset operation that the electronic device can automatically perform, such as a page turning operation and a return operation.
  • the operation corresponding to the action can also be a user-defined operation. This is not specifically limited.
  • the electronic device when it is determined that the nodding action occurs, the electronic device is controlled to respond to the page turning operation corresponding to the nodding action; when it is determined that there is a shaking action When this occurs, the control electronics respond to a return operation corresponding to the shaking motion.
  • the electronic device when it is determined that a nodding action occurs, the electronic device can also be controlled to respond to a return operation corresponding to the nodding action; when it is determined that a shaking action occurs, the electronic device can also be controlled to respond to a page turning operation corresponding to the shaking action. .
  • the electronic device is pre-configured with a method for giving action judgment according to the action classification result, which is called a decision strategy here.
  • the electronic device combines the action classification result output by the action recognition model and the preset decision strategy for action judgment to determine whether an action has occurred. If it is determined by the decision strategy that a nodding action occurs, the electronic device is controlled to respond to the page turning operation corresponding to the nodding action. Or return operation; if it is determined through the decision strategy that a shaking motion occurs, the electronic device is controlled to respond to a returning operation or a page turning operation corresponding to the shaking motion.
  • the preset decision-making strategy can also be adjusted according to the action classification results output by the action recognition model. For example, the same action category appears several times in the n results, or the same action category appears several times in a row, then The decision-making strategy can be adjusted to determine that an action occurs when the same action category occurs several times in a row.
  • the amplitude of the nodding action is set to face the ground downwards by 10 degrees. If it is detected that the user's head is facing the ground downwards by more than 10 degrees, it is determined that the nodding action has occurred; if it is detected that the user's head is facing the ground If the magnitude of the downward movement does not exceed 10 degrees, it is determined that no nodding action occurs.
  • the preset amplitude of the shaking motion may be set to move 40 degrees to the left or right in advance, and when it is detected that the amplitude of the user's head moving to the left or right exceeds 40 degrees, it can be determined that a shaking motion occurs. , as shown in Figure 4; if it is detected that the amplitude of the user's head movement to the left or right does not exceed 40 degrees, it is determined that no shaking action occurs, as shown in Figure 5.
  • sensor signal data with a preset window length is first obtained, then the sensor signal data is input into an action recognition model, and an action classification result output by the action recognition model is obtained.
  • the control electronics respond to an operation corresponding to the action.
  • an action recognition method provided by an embodiment of the present application is applied to an electronic device, and the method includes:
  • S210 When a designated event is detected, start acquiring sensor data collected by an acceleration sensor in the head mounted device, so as to acquire sensor data with a preset window length on a time series.
  • the sensor data is sensor data collected by an acceleration sensor built in the head-mounted device when the head-mounted device is in a wearing state.
  • the head-mounted device is a wireless Bluetooth headset;
  • the acceleration sensor is a three-axis acceleration sensor.
  • the infrared sensor set in the wireless Bluetooth headset can be used to detect whether the wireless Bluetooth headset is in a wearing state. .
  • the wireless Bluetooth headset is worn on the human ear, some areas will be blocked. In this case, the area that will be blocked after the wireless Bluetooth headset is worn can be set.
  • the method of infrared sensor and then can determine whether the infrared signal emitted by the infrared sensor is blocked by detecting the state value returned by the infrared sensor, so as to determine whether the wireless Bluetooth headset is in the wearing state or not.
  • the returned state value indicates that the infrared signal is blocked, it is determined that the wireless Bluetooth headset is in a wearing state, and when the returned state value indicates that the infrared signal is not blocked, it is determined that the wireless Bluetooth headset is not wearing.
  • the wireless Bluetooth headset can at least include a locked state, an unlocked state, an off state, an on state, a sleep state, or a combination of several of these states, for example, a locked and off state, a locked and off state, and an unlocked and powered on state. Status, locked and powered off status, etc., are not limited here.
  • the wireless Bluetooth headset when the wireless Bluetooth headset is in a locked state, it means that multiple function keys or function buttons of the wireless Bluetooth headset are inoperable, so as to prevent the user from accidentally touching the wireless Bluetooth headset and to prevent the wireless Bluetooth headset from being used by others without the user's permission; when the wireless Bluetooth headset is inoperable When it is in the unlocked state, it means that multiple function keys or function buttons of the wireless Bluetooth headset can be operated, so as to facilitate the user to adjust the functions of the wireless Bluetooth headset, such as turning up the volume, lowering the volume, etc.; when the wireless Bluetooth headset is turned on , it means that the wireless bluetooth headset can currently be used; when the wireless bluetooth headset is turned off, it means that the wireless bluetooth headset is currently unavailable; when the wireless bluetooth headset is in a dormant state, it means that the wireless bluetooth headset is currently in a standby state.
  • the wireless Bluetooth headset After it is determined that the wireless Bluetooth headset is turned on and worn, it can detect whether a specified event occurs, and when a specified event is detected, the built-in acceleration sensor in the wireless Bluetooth headset starts to collect sensor data to obtain a time series. sensor data over a preset window length.
  • the specified event may be any preset event that can trigger the wireless Bluetooth headset to send sensor data to the electronic device, such as an incoming call event.
  • whether the specified event is detected may be determined by detecting whether the electronic device receives a specific identification.
  • different identifiers can be set for different events in advance. When an event occurs, the electronic device will first receive the identifier corresponding to the event, and then can determine whether the event is detected by detecting whether the identifier is received. flag, it is determined that an event has been detected. Further, the electronic device may identify the received identification, so as to determine whether the specified event is detected by identifying the identification.
  • identifiers are set for different events in advance, wherein the events may include incoming call events, power-on events, and message receiving events, etc.
  • the identifiers corresponding to the above events may be respectively set as: the identifiers corresponding to the incoming call events are set.
  • the logo corresponding to the boot event is set to KJSJ
  • the logo corresponding to the message receiving event is set to XXSJ.
  • a specified event can be set in the electronic device in advance, and when the specified event is detected, the sensor data collected by the acceleration sensor in the wireless Bluetooth headset is started to be acquired.
  • the specified event is set as an incoming call event in advance, and then when the electronic device receives the identification and determines that an event has occurred, it starts to identify the identification. If the identification is identified as LDSJ, it is determined that the specified event is detected.
  • the acceleration sensor is a three-axis acceleration sensor
  • the wireless Bluetooth headset when the wireless Bluetooth headset is turned on and in the wearing state, when a specified event is detected, the built-in three-axis acceleration sensor of the wireless Bluetooth headset starts to Sensor data is collected in three dimensions: X, Y, and Z.
  • S220 Acquire a value of the data to be input, where the value of the data to be input is an average value of the values of all channels of the sensor data of the preset window length.
  • the average value of the values of the X, Y, and Z channels of each sensor data in the sensor data of the preset window length is acquired, and the preset window length is obtained.
  • the average value of the values of the X, Y, and Z channels of each sensor data in the sensor data of the window length is used as the value of the data to be input, wherein the data to be input needs to be input into the action recognition model for action recognition. sensor data.
  • S230 Input the value of the data to be input into the action recognition model, and obtain the action classification result output by the action recognition model.
  • the value of the data to be input obtained by the above method is input into the action recognition model, and the input data to be recognized is recognized by the action recognition model, and then the action classification result output by the action recognition model can be obtained.
  • S240 Perform zero return processing on the data to be input.
  • the data to be input is preprocessed to zero.
  • the value of the data to be input is set as the average value of all channels of the sensor data of the preset window length currently obtained to obtain the new data to be input, and then the new data to be input is obtained.
  • the input data is input into the action recognition model to perform the action recognition operation, and the above operations are repeated during the action recognition process.
  • an action recognition method when a specified event is detected, sensor data collected by an acceleration sensor in a head-mounted device is started to be obtained, so as to obtain sensor data of a preset window length in a time series, and then the sensor data of a preset window length in a time series is obtained.
  • the value of the input data, the value of the data to be input is the average value of the values of all channels of the sensor with the preset window length, the value of the data to be input is input into the action recognition model, the action classification result output by the action recognition model is obtained, and then the The data to be input is reset to zero.
  • an action recognition method provided by an embodiment of the present application is applied to an electronic device, and the method includes:
  • S310 When a specified event is detected, start acquiring sensor data collected by an acceleration sensor in the head-mounted device, so as to acquire sensor data with a preset window length on a time series.
  • the specified event is an incoming call event
  • the sensor data of the preset window length on the time series may be the sensor data of the preset window length collected by the acceleration sensor in the head-mounted device in the time sequence, and then Send the sensor data of the preset window length to the electronic device, or the sensor data collected in real time for the acceleration sensor in the headset and then sent to the electronic device in real time.
  • the electronic device stores the preset window length to meet the preset window length
  • the preset The sensor data of the window length is input to the action recognition model.
  • one moment may correspond to one sensor data.
  • the sensor data corresponding to one time point can be selected every four time points from the sensor data in the time series sent by the head-mounted device. Store, or take every four sensor data points to store sensor data.
  • the sensor data sent by the head mounted device in chronological order is "data 1, data 2, data 3, data 4, data 5, data 6, data 7, data 8, data 9, data 10", and the electronic device according to The method of taking one sensor data every four sensor data points for storage, the stored sensor data includes "data 1, data 5 and data 9".
  • S320 Input the sensor data into an action recognition model, and obtain an action classification result output by the action recognition model.
  • the result queue of preset length is a preset queue of length n that stores the action classification results output by the action recognition model.
  • the inference results of the sensor data of the previous n preset window lengths are stored in the result queue of the preset length.
  • updating the result queue can be understood as updating the result queue once it is detected that a new action classification result is stored in the result queue.
  • the length of the result queue is a fixed length, after storing n action classification results, when a new action classification result is stored, it is necessary to remove the originally stored action classification result in the result queue.
  • the action classification results originally stored in the result queue can be removed based on the first-in-first-out principle. After the action classification result originally stored in the result queue is removed, the new action classification result is stored in the result queue.
  • S340 Determine whether an action occurs based on the updated result queue.
  • the electronic device After the result queue is updated by the above method, every time the result queue is updated, it is determined whether an action occurs according to the decision policy.
  • the decision-making strategy determines that an action occurs, the electronic device will perform a certain cooling process, and the electronic device will set the value of the newly generated action classification result to a preset value to prevent the electronic device from responding continuously due to the detection of the same action.
  • the step of setting the value of the newly generated action classification result as a preset value may include: if it is determined that a specified action occurs based on multiple classification results in the updated result queue, classifying the newly generated classification result as a preset value.
  • the result is set to a preset value and stored in the result queue; or, if it is judged that a specified action occurs based on multiple classification results in the updated result queue, the newly generated classification result is stored in the result queue.
  • Rear is the default value.
  • the preset value may be any set value.
  • the electronic device will wait for a period of time before inputting the sensor data of the next preset window length into the action recognition model.
  • control electronic device invokes the interface to respond to the incoming incoming call operation corresponding to the nodding action.
  • the control electronic device responds to an operation of rejecting the incoming call corresponding to the shaking motion.
  • the application scenarios may include the following scenarios: Application scenarios of device operation, application scenarios of hands being occupied by other affairs, and application scenarios of private interaction when disabled or inconvenient to perform voice operations, etc.
  • sensor data with a preset window length in a time series is acquired, the sensor data is input into an action recognition model, an action classification result output by the action recognition model is obtained, and the action classification result is stored in a preset Set the length of the result queue to update the result queue, and then judge whether there is an action based on the updated result queue.
  • the control electronic device responds to the nodding action corresponding to the incoming incoming call operation, When it is determined that a shaking motion occurs, the control electronic device puts in an operation of rejecting the incoming call in response to the shaking motion.
  • the user can answer the call by nodding, and reject the call by shaking his head, which solves the inconvenient hand touch screen operation in some scenarios.
  • the user which provides users with a brand-new basic interaction method, and also provides users with one more interactive choice, which at the same time helps to improve the expressiveness of each electronic device and enhance the user experience.
  • a motion recognition apparatus 400 provided by the present application operates on an electronic device, and the apparatus 400 includes a data acquisition unit 410 , a result output unit 420 and an operation execution unit 430 .
  • the data acquisition unit 410 is configured to acquire sensor data with a preset window length on a time series.
  • the data acquisition unit 410 is configured to start acquiring sensor data collected by an acceleration sensor in the headset when a specified event is detected, so as to acquire sensor data with a preset window length on a time series.
  • the data acquisition unit 410 is further configured to acquire the connection state between the electronic device and the head-mounted device; if the electronic device and the head-mounted device are in the connection state, acquire a preset window in the time series length of sensor data.
  • the data acquisition unit 410 is further configured to acquire the state value of the electronic device; if the state value of the electronic device is the first state value, it is determined that the electronic device is connected to the head-mounted device. state; if the state value of the electronic device is the second state value, it is determined that the electronic device and the head mounted device are in a disconnected state.
  • the data acquisition unit 410 is further configured to determine the connection state between the electronic device and the head-mounted device by monitoring broadcasts.
  • the data acquisition unit 410 is configured to detect whether the wireless Bluetooth headset is in a wearing state; if the wireless Bluetooth headset is in a wearing state, acquire sensor data of a preset window length on the time series.
  • the data acquisition unit 410 is further configured to acquire the status value returned by the infrared sensor set in the wireless Bluetooth headset; if the status value indicates that the infrared signal emitted by the infrared sensor is blocked, it is determined that the wireless The Bluetooth headset is in a wearing state; if the state value indicates that the infrared signal emitted by the infrared sensor is not blocked, it is determined that the wireless Bluetooth headset is in an unworn state.
  • the data acquisition unit 410 is further configured to detect whether the wireless Bluetooth headset is in a powered-on state if the wireless Bluetooth headset is in a wearing state; when it is determined that the wireless Bluetooth headset is in a wearing state and is in a powered-on state, Acquire sensor data for a preset window length over a time series.
  • the data acquisition unit 410 is further configured to detect whether multiple function keys or function buttons of the wireless Bluetooth headset are in a normal working state; if it is detected that multiple function keys or function buttons of the wireless Bluetooth headset are in In a normal working state, it is determined that the wireless Bluetooth headset is in a powered-on state.
  • the data acquisition unit 410 is further configured to detect whether the wireless Bluetooth headset can currently be used; if it is detected that the wireless Bluetooth headset can currently be used, it is determined that the wireless Bluetooth headset is in a powered-on state.
  • the result output unit 420 is configured to input the sensor data into the action recognition model, and obtain the action classification result output by the action recognition model.
  • the action recognition model includes a first convolutional layer, a second convolutional layer, a maximum pooling layer, a third convolutional layer, a fourth convolutional layer, a global average pooling layer, a fully connected layer and softmax layer;
  • the first convolutional layer and the second convolutional layer are convolutional layers with a convolution kernel of 7 and a dimension of 64;
  • the third convolutional layer and the fourth convolutional layer are volumes A convolutional layer with a kernel of 7 and a dimension of 128.
  • the convolutional neural network model is iteratively trained until the number of iterations reaches the maximum number of iterations, and the convolutional neural network model when the maximum number of iterations is reached is used as the action recognition model.
  • the cross entropy loss function is Wherein: the M represents the number of action categories; the y ic represents the indicator variable; the pic represents the predicted probability that the observed sample i belongs to the action category c.
  • the result output unit 420 is configured to obtain the value of the data to be input, where the value of the data to be input is the average value of the values of all channels of the sensor data of the preset window length;
  • the value of the action recognition model is input to the action recognition model, and the action classification result output by the action recognition model is obtained; the data to be input is zeroed.
  • the operation execution unit 430 is configured to control the electronic device to respond to an operation corresponding to the action if it is determined that an action occurs based on the action classification result.
  • the operation execution unit 430 is configured to control the electronic device to respond to an incoming call operation corresponding to the nodding action when it is determined that a nodding action occurs; when it is determined that a shaking action occurs, control the electronic device The device responds to an operation of rejecting the incoming call corresponding to the shaking motion.
  • the apparatus 400 further includes:
  • the action judging unit 440 is configured to save the action classification result in a result queue of preset length, so as to update the result queue; and determine whether an action occurs based on the updated result queue.
  • the apparatus 400 further includes:
  • the result saving unit 450 is configured to set the newly generated classification result as a preset value and store it in the result queue if it is judged that a specified action occurs based on a plurality of classification results in the updated result queue; or, If it is determined that a specified action occurs based on the plurality of classification results in the updated result queue, the newly generated classification result is stored in the result queue and set as a preset value.
  • an embodiment of the present application further provides another electronic device 100 that can execute the foregoing action recognition method.
  • the electronic device 100 includes one or more (only one shown in the figure) a processor 102, a memory 104, and a network module 106 that are coupled to each other.
  • the memory 104 stores a program that can execute the content in the foregoing embodiments
  • the processor 102 can execute the program stored in the memory 104 .
  • the processor 102 may include one or more processing cores.
  • the processor 102 uses various interfaces and lines to connect various parts of the entire electronic device 100, and executes by running or executing the instructions, programs, code sets or instruction sets stored in the memory 104, and calling the data stored in the memory 104.
  • the processor 102 may adopt at least one of digital signal processing (Digital Signal Processing, DSP), field-programmable gate array (Field-Programmable Gate Array, FPGA), and programmable logic array (Programmable Logic Array, PLA). implemented in a hardware form.
  • DSP Digital Signal Processing
  • FPGA Field-Programmable Gate Array
  • PLA programmable logic array
  • the processor 102 may integrate one or a combination of a central processing unit (Central Processing Unit, CPU), a graphics processing unit (Graphics Processing Unit, GPU), a modem, and the like.
  • CPU Central Processing Unit
  • GPU Graphics Processing Unit
  • the CPU mainly handles the operating system, user interface and application programs, etc.
  • the GPU is used for rendering and drawing of the display content
  • the modem is used to handle wireless communication. It can be understood that, the above-mentioned modem may not be integrated into the processor 102, and is implemented by a communication chip alone.
  • the memory 104 may include a random access memory (Random Access Memory, RAM), or may include a read-only memory (Read-Only Memory, ROM). Memory 104 may be used to store instructions, programs, codes, sets of codes, or sets of instructions.
  • the memory 104 may include a stored program area and a stored data area, wherein the stored program area may store instructions for implementing the operating system, instructions for implementing at least one function (such as a touch function, a sound playback function, an image playback function, etc.) , instructions for implementing the following method embodiments, and the like.
  • the memory 104 may store motion recognition means.
  • the device for setting motion recognition may be the aforementioned device 400 .
  • the storage data area can also store data (such as phone book, audio and video data, chat record data) created by the electronic device 100 during use.
  • the network module 106 is used for receiving and sending electromagnetic waves, realizing mutual conversion between electromagnetic waves and electrical signals, so as to communicate with a communication network or other devices, for example, communicate with an audio playback device.
  • the network module 106 may include various existing circuit elements for performing these functions, eg, antennas, radio frequency transceivers, digital signal processors, encryption/decryption chips, subscriber identity module (SIM) cards, memory, etc. .
  • the network module 106 can communicate with various networks such as the Internet, an intranet, a wireless network, or communicate with other devices through a wireless network.
  • the aforementioned wireless network may include a cellular telephone network, a wireless local area network, or a metropolitan area network.
  • the network module 106 may interact with the base station for information.
  • FIG. 12 shows a structural block diagram of a computer-readable storage medium provided by an embodiment of the present application.
  • the computer-readable storage medium 800 stores program codes, and the program codes can be invoked by the processor to execute the methods described in the above method embodiments.
  • the computer readable storage medium 800 may be an electronic memory such as flash memory, EEPROM (Electrically Erasable Programmable Read Only Memory), EPROM, hard disk, or ROM.
  • the computer-readable storage medium 800 includes a non-transitory computer-readable storage medium.
  • Computer readable storage medium 800 has storage space for program code 810 to perform any of the method steps in the above-described methods. These program codes can be read from or written to one or more computer program products.
  • Program code 810 may be compressed, for example, in a suitable form.
  • sensor signal data with a preset window length is first obtained, and then the sensor signal data is input into an action recognition model to obtain an action classification result output by the action recognition model, Finally, if it is determined that an action occurs based on the action classification result, the electronic device is controlled to respond to an operation corresponding to the action.
  • the sensor data of the preset window length in the acquired time series can be recognized by the motion recognition model, so that whether there is an action can be quickly and accurately identified, the accuracy of the action recognition can be improved, and then it can be determined When an action occurs, the control electronic device responds to the operation corresponding to the action in real time.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Disclosed are an action recognition method and apparatus, and an electronic device and a storage medium. The method comprises: acquiring sensor data of a pre-set window length on a time sequence; inputting the sensor data into an action recognition model, and acquiring an action classification result output by the action recognition model; and if it is determined, on the basis of the action classification result, that an action occurs, controlling an electronic device to respond to an operation corresponding to the action. By means of the method, acquired sensor data of a pre-set window length on a time sequence is recognized by means of an action recognition model, and whether an action occurs can be quickly and accurately recognized, such that the accuracy of action recognition is improved, and thus, when it is determined that an action occurs, an electronic device can be controlled to respond, in real time, to an operation corresponding to the action.

Description

动作识别方法、装置、电子设备及存储介质Motion recognition method, device, electronic device and storage medium
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请要求于2021年1月28日提交的申请号为202110118213.4的中国申请的优先权,其在此出于所有目的通过引用将其全部内容并入本文。This application claims priority to Chinese Application No. 202110118213.4 filed on January 28, 2021, which is hereby incorporated by reference in its entirety for all purposes.
技术领域technical field
本申请涉及人体动作识别技术领域,更具体地,涉及一种动作识别方法、装置、电子设备以及存储介质。The present application relates to the technical field of human motion recognition, and more particularly, to a motion recognition method, device, electronic device, and storage medium.
背景技术Background technique
人体动作主要是指人体行动的方式,以及人对环境或物体的反应,人体通过肢体的复杂运动,来描述或表达复杂的人体动作。可以说,人体的动作多数是需要通过人体肢体的运动来体现出来。通过对人体的运动来研究探索人体的动作就成为分析人体动作的一个非常有效的途径。Human action mainly refers to the way the human body moves and the human response to the environment or objects. The human body describes or expresses complex human actions through the complex movements of the limbs. It can be said that most of the actions of the human body need to be reflected through the movement of the human body. It is a very effective way to analyze the movement of the human body by studying and exploring the movement of the human body.
发明内容SUMMARY OF THE INVENTION
鉴于上述问题,本申请实施例提出了一种动作识别方法、装置、电子设备以及存储介质,以改善上述问题。In view of the above problems, the embodiments of the present application propose a motion recognition method, apparatus, electronic device, and storage medium to improve the above problems.
第一方面,本申请实施例提供了一种动作识别方法,所述方法包括:获取时间序列上的预设窗口长度的传感器数据;将所述传感器数据输入动作识别模型,获取所述动作识别模型输出的动作分类结果;若基于所述动作分类结果确定有动作发生时,控制所述电子设备响应于与所述动作对应的操作。In a first aspect, an embodiment of the present application provides an action recognition method, the method includes: acquiring sensor data with a preset window length in a time series; inputting the sensor data into an action recognition model, and acquiring the action recognition model The output action classification result; if it is determined that an action occurs based on the action classification result, the electronic device is controlled to respond to an operation corresponding to the action.
第二方面,本申请实施例提供了一种动作识别装置,所述装置包括:数据获取单元,用于获取时间序列上的预设窗口长度的传感器数据;结果输出单元,用于将所述传感器数据输入动作识别模型,获取所述动作识别模型输出的动作分类结果;操作执行单元,用于若基于所述动作分类结果确定有动作发生时,控制所述电子设备响应于与所述动作对应的操作。In a second aspect, an embodiment of the present application provides a motion recognition device, the device includes: a data acquisition unit for acquiring sensor data of a preset window length on a time series; a result output unit for The data is input into the action recognition model, and the action classification result output by the action recognition model is obtained; the operation execution unit is used to control the electronic device to respond to the action corresponding to the action if it is determined that an action occurs based on the action classification result. operate.
第三方面,本申请实施例提供了一种电子设备,包括一个或多个处理器以及存储 器;一个或多个程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行以实现上述的方法。In a third aspect, embodiments of the present application provide an electronic device, including one or more processors and a memory; one or more programs are stored in the memory and configured to be executed by the one or more processors Execute to implement the above method.
第四方面,本申请实施例提供了一种计算机可读存储介质,所述计算机可读存储介质中存储有程序代码,其中,在所述程序代码被处理器运行时执行上述的方法。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, where a program code is stored in the computer-readable storage medium, wherein the above method is executed when the program code is executed by a processor.
附图说明Description of drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those skilled in the art, other drawings can also be obtained from these drawings without creative effort.
图1示出了本申请一实施例提出的一种动作识别方法的应用环境示意图;FIG. 1 shows a schematic diagram of an application environment of an action recognition method proposed by an embodiment of the present application;
图2示出了本申请一实施例提出的一种动作识别方法的流程图;FIG. 2 shows a flowchart of an action recognition method proposed by an embodiment of the present application;
图3示出了本申请一实施例提出的一种动作识别模型的结构示意图;FIG. 3 shows a schematic structural diagram of an action recognition model proposed by an embodiment of the present application;
图4示出了本申请一实施例提出的一种发生摇头动作的示意图;FIG. 4 shows a schematic diagram of a shaking head motion according to an embodiment of the present application;
图5示出了本申请一实施例提出的一种没有发生摇头动作的示意图;FIG. 5 shows a schematic diagram of a shaking motion without shaking according to an embodiment of the present application;
图6示出了本申请另一实施例提出的一种动作识别方法的流程图;FIG. 6 shows a flowchart of an action recognition method proposed by another embodiment of the present application;
图7示出了本申请又一实施例提出的一种动作识别方法的流程图;FIG. 7 shows a flowchart of an action recognition method proposed by another embodiment of the present application;
图8示出了本申请一实施例提出的一种动作识别装置的结构框图;FIG. 8 shows a structural block diagram of a motion recognition device proposed by an embodiment of the present application;
图9示出了本申请一实施例提出的又一种动作识别装置的结构框图;FIG. 9 shows a structural block diagram of another motion recognition device proposed by an embodiment of the present application;
图10示出了本申请一实施例提出的再一种动作识别装置的结构框图;FIG. 10 shows a structural block diagram of still another action recognition device proposed by an embodiment of the present application;
图11示出了本申请实时中的用于执行根据本申请实施例的动作识别方法的电子设备的结构框图;FIG. 11 shows a structural block diagram of an electronic device for executing the motion recognition method according to an embodiment of the present application in real time of the present application;
图12示出了本申请实时中的用于保存或者携带实现根据本申请实施例的动作识别方法的程序代码的存储单元。FIG. 12 shows a storage unit in real time of the present application for storing or carrying program codes for implementing the motion recognition method according to the embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.
人体动作主要是指人体行动的方式,以及人对环境或物体的反应,人体通过肢体的复杂运动,来描述或表达复杂的人体动作。可以说,人体的动作多数是需要通过人体肢体的运动来体现出来。通过对人体的运动来研究探索人体的动作就成为分析人体动作的一个非常有效的途径。Human action mainly refers to the way the human body moves and the human response to the environment or objects. The human body describes or expresses complex human actions through the complex movements of the limbs. It can be said that most of the actions of the human body need to be reflected through the movement of the human body. It is a very effective way to analyze the movement of the human body by studying and exploring the movement of the human body.
发明人在对相关的动作识别方法的研究中发现,相关的动作识别方法对人体动作进行识别的准确性还有待提高。The inventor found in the research on the related action recognition methods that the accuracy of the related action recognition methods for recognizing human actions still needs to be improved.
因此,发明人提出了本申请实施例中的动作识别的方法、装置、电子设备以及存储介质,通过获取预设窗口长度的传感器信号数据,然后将传感器信号数据输入动作识别模型,获取动作识别模型输出的动作分类结果,最后若基于动作分类结果确定有动作发生时,控制电子设备响应于与动作对应的操作。通过动作识别模型对获取到的时间序列上的预设窗口长度的传感器数据进行识别,可以快速准确的识别出是否有动作发生,提高了动作识别的准确性,进而可以在确定了有动作发生时,控制电子设备实时响应于与该动作对应的操作。Therefore, the inventor proposes the method, device, electronic device, and storage medium for motion recognition in the embodiments of the present application. By acquiring sensor signal data with a preset window length, and then inputting the sensor signal data into the motion recognition model, the motion recognition model is obtained. The output action classification result, and finally, if it is determined that an action occurs based on the action classification result, the electronic device is controlled to respond to an operation corresponding to the action. The sensor data of the preset window length in the acquired time series can be recognized by the motion recognition model, which can quickly and accurately identify whether there is an action, which improves the accuracy of the action recognition, and can then determine when an action occurs. , the control electronic device responds in real time to the operation corresponding to the action.
下面针对本发明实施提供的动作识别方法的应用环境进行介绍:The following is an introduction to the application environment of the action recognition method provided by the implementation of the present invention:
请参阅图1,本发明实施提供的动作识别方法可以应用于人机交互系统100,该人机交互系统100可以包括电子设备110以及头戴设备120。如图1所示,所述电子设备110可以为智能手机、平板电脑、智能穿戴设备(如智能手环、智能手表等)、智慧大屏、网关、车载设备以及笔记本电脑等,所述头戴设备120可以为耳机,如无线蓝牙耳机或者有线耳机等。其中,电子设备110和头戴设备120之间可以通过无线连接,也可以通过实体连接线进行连接。电子设备110与头戴设备120可以通过无线通信协议建立通信链路,其中,无线通信协议可以包括Wlan协议、蓝牙协议或者ZigBee协议等。Referring to FIG. 1 , the motion recognition method provided by the implementation of the present invention can be applied to a human-computer interaction system 100 , and the human-computer interaction system 100 may include an electronic device 110 and a head-mounted device 120 . As shown in FIG. 1 , the electronic device 110 may be a smart phone, a tablet computer, a smart wearable device (such as a smart bracelet, a smart watch, etc.), a smart screen, a gateway, a vehicle-mounted device, a laptop, etc. The device 120 may be a headset, such as a wireless Bluetooth headset or a wired headset. The electronic device 110 and the head-mounted device 120 may be connected wirelessly, or may be connected by a physical connection line. The electronic device 110 and the head mounted device 120 may establish a communication link through a wireless communication protocol, where the wireless communication protocol may include a Wlan protocol, a Bluetooth protocol, or a ZigBee protocol.
所述电子设备110中可以包括动作识别模块,用于对头戴设备120采集的传感器数据进行动作识别,进而可以通过动作识别结果对电子设备110进行控制。所述电子设备110可以和一个头戴设备120连接,也可以和多个头戴设备120连接。其中,多个头戴设备,可以是两个或者两个以上的蓝牙耳机。The electronic device 110 may include a motion recognition module for performing motion recognition on the sensor data collected by the head mounted device 120, and then the electronic device 110 may be controlled through the motion recognition result. The electronic device 110 may be connected to one head mounted device 120 , or may be connected to multiple head mounted devices 120 . The multiple head-mounted devices may be two or more Bluetooth headsets.
另外,电子设备110也可以通过有线网络或者无线网络与其他电子设备建立网络连接。如通过Wi-Fi连接、通过移动无线网络连接等。In addition, the electronic device 110 may also establish a network connection with other electronic devices through a wired network or a wireless network. Such as via Wi-Fi connection, via mobile wireless network connection, etc.
下面将结合附图具体描述本申请的各实施例。The embodiments of the present application will be described in detail below with reference to the accompanying drawings.
请参阅图2,本申请实施例提供的一种动作识别方法,应用于电子设备,所述方法包括:Referring to FIG. 2, an action recognition method provided by an embodiment of the present application is applied to an electronic device, and the method includes:
S110:获取时间序列上的预设窗口长度的传感器数据。S110: Acquire sensor data of a preset window length on a time series.
作为一种方式,所述预设窗口长度为电子设备中预先设置的具有一定长度的窗口,所述窗口的长度可以设置为64ms,所述窗口在时间序列上每次滑动的步长可以设置为12ms;所述传感器数据为头戴设备实时采集发送的数据。In one way, the preset window length is a window with a certain length preset in the electronic device, the length of the window can be set to 64ms, and the step size of each sliding of the window in the time series can be set to 12ms; the sensor data is the data collected and sent by the head-mounted device in real time.
在本申请实施例中,在获取时间序列上的预设窗口长度的传感器数据之前还可以检测电子设备是否与头戴设备处于连接状态,若电子设备与头戴设备处于连接状态,电子设备再获取时间序列上的预设窗口长度的传感器数据。In the embodiment of the present application, before acquiring the sensor data of the preset window length in the time series, it is also possible to detect whether the electronic device is in a connected state with the head-mounted device. Sensor data for preset window lengths over time series.
具体的,电子设备与头戴设备之间的连接状态可以包括处于连接状态和没有处于连接状态,其中,没有处于连接状态包括非连接状态和连接中断状态。Specifically, the connection state between the electronic device and the head-mounted device may include a connected state and a non-connected state, wherein the non-connected state includes a non-connected state and a connection interruption state.
作为其中一种方式,可以通过查看电子设备的状态值的方式来判断电子设备与头戴设备之间的连接状态,具体的,可以预先为电子设备设置两个不同的状态值,当电 子设备连接头戴设备时,返回第一状态值,当电子设备没有连接头戴设备时,返回第二状态值,从而可以通过检测第一状态值和第二状态值的方式来确定电子设备是否与头戴设备处于连接状态。示例性的,预先将电子设备的第一状态值设置为1,将电子设备的第二状态值设置为0,若检测到电子设备的状态值为1,则确定电子设备与头戴设备处于连接状态,若检测到电子设备返回的状态值为0,则确定电子设备与头戴设备处于非连接状态。可选的,若在时间顺序上检测到相邻时刻电子设备返回的状态值由1变为0,则确定电子设备与头戴设备处于连接中断状态。As one of the methods, the connection state between the electronic device and the head-mounted device can be judged by checking the state value of the electronic device. Specifically, two different state values can be set for the electronic device in advance. When the electronic device is connected When the head-mounted device is used, the first state value is returned, and when the electronic device is not connected to the head-mounted device, the second state value is returned, so that whether the electronic device is connected to the head-mounted device can be determined by detecting the first state value and the second state value. The device is connected. Exemplarily, the first state value of the electronic device is set to 1 in advance, and the second state value of the electronic device is set to 0. If the state value of the electronic device is detected to be 1, it is determined that the electronic device is connected to the head-mounted device. If it is detected that the status value returned by the electronic device is 0, it is determined that the electronic device and the headset are in a disconnected state. Optionally, if it is detected in time sequence that the state value returned by the electronic device changes from 1 to 0 at adjacent moments, it is determined that the connection between the electronic device and the head-mounted device is in a disconnected state.
作为其中另一种方式,电子设备在连接头戴设备和断开头戴设备的时候都会发送广播,所以,电子设备可以通过监听广播的方式,确定电子设备是否与头戴设备处于连接状态。As another method, the electronic device sends a broadcast when the head-mounted device is connected and disconnected, so the electronic device can determine whether the electronic device is in a connected state with the head-mounted device by monitoring the broadcast.
在本申请实施例中,所述获取时间序列上的预设窗口长度的传感器数据为按照时间顺序获取头戴设备发送的传感器数据。头戴设备实时采集传感器数据,并实时向电子设备采集的传感器数据,电子设备实时保存传感器数据,当电子设备获取的传感器数据的数据长度满足预先设置的预设窗口长度时,将预设窗口长度的传感器数据发送给电子设备中的动作识别模块。In the embodiment of the present application, the acquisition of the sensor data of the preset window length in the time series is to acquire the sensor data sent by the head-mounted device in the time sequence. The head-mounted device collects sensor data in real time, and collects sensor data from the electronic device in real time. The electronic device saves the sensor data in real time. When the data length of the sensor data obtained by the electronic device meets the preset preset window length, the preset window length is set. The sensor data is sent to the motion recognition module in the electronic device.
S120:将所述传感器数据输入动作识别模型,获取所述动作识别模型输出的动作分类结果。S120: Input the sensor data into an action recognition model, and obtain an action classification result output by the action recognition model.
在本申请实施例中,所述动作识别模型包括依次连接的第一卷积层,第二卷积层,最大池化层,第三卷积层,第四卷积层,全局平均池化层,全连层以及softmax层;所述第一卷积层和所述第二卷积层为卷积核为7且维度为64的卷积层;所述第三卷积层和所述第四卷积层为卷积核为7且维度为128的卷积层。动作识别模型的结构如图3所示。In the embodiment of the present application, the action recognition model includes a first convolutional layer, a second convolutional layer, a maximum pooling layer, a third convolutional layer, a fourth convolutional layer, and a global average pooling layer that are connected in sequence , the fully connected layer and the softmax layer; the first convolutional layer and the second convolutional layer are convolutional layers with a convolution kernel of 7 and a dimension of 64; the third convolutional layer and the fourth convolutional layer The convolutional layer is a convolutional layer with a convolution kernel of 7 and a dimension of 128. The structure of the action recognition model is shown in Figure 3.
在图3中,第一卷积层和第二卷积层两个卷积层的作用是用于提取获取的时间序列上的的预设窗口长度的传感器数据的特征;最大池化层的作用是降低获取的时间序列上的预设窗口长度的传感器数据的维度以及获取时间序列上的预设窗口长度的传感器数据在一定程度的平移不变性;第三卷积层和第四卷积层两个卷积层的作用是用于进一步提取获取的时间序列上的预设窗口长度的传感器数据的高阶特征并且在特征尺度缩小后增大维度来保持信息的丰富程度;全局平均池化层的作用是将获取的时间序列上的预设窗口长度的传感器数据中各个位置检测到的特征集中起来,增强平移不变性;全连接层的作用是将所有特征转化为各个动作类别的logits数值;Softmax层的作用是将logits数值转化成总和为1的概率值。In Figure 3, the functions of the first convolutional layer and the second convolutional layer are to extract the features of the sensor data with a preset window length on the acquired time series; the function of the maximum pooling layer It is to reduce the dimension of the sensor data of the preset window length on the acquired time series and the translation invariance of the sensor data of the preset window length on the acquired time series to a certain extent; the third convolution layer and the fourth convolution layer two The function of each convolutional layer is to further extract the high-order features of the sensor data with a preset window length on the acquired time series and increase the dimension to maintain the richness of information after the feature scale is reduced; the global average pooling layer The function is to collect the features detected at each position in the sensor data of the preset window length on the acquired time series to enhance the translation invariance; the function of the fully connected layer is to convert all the features into the logits value of each action category; Softmax The role of the layer is to convert the logits value into a probability value that sums to 1.
另外在图3中没有显示的是,在每一个卷积层之后都加入了一个Relu激活函数来增强动作识别模型的非线性能力。在动作识别模型的训练中,全连接层之前加入一个概率为0.5的Dropout层,在该动作识别模型中,Dropout层的作用是随机将一半的神经元的值设置为0,根据剩下的一半的神经元来预测结果,来增强动作识别模型的泛化 能力。Also not shown in Figure 3 is that a Relu activation function is added after each convolutional layer to enhance the non-linear capability of the action recognition model. In the training of the action recognition model, a dropout layer with a probability of 0.5 is added before the fully connected layer. In this action recognition model, the function of the dropout layer is to randomly set the value of half of the neurons to 0, according to the remaining half The neurons are used to predict the result to enhance the generalization ability of the action recognition model.
在本申请实施例中,动作识别模型中采用的损失函数是交叉熵损失函数。其中,交叉熵损失函数公式如下:In this embodiment of the present application, the loss function used in the action recognition model is a cross-entropy loss function. Among them, the formula of the cross entropy loss function is as follows:
Figure PCTCN2021139746-appb-000001
Figure PCTCN2021139746-appb-000001
其中:M表征动作类别的数量;y ic表征指示变量(0或者1),如果预测出的动作类别和观测样本i的动作类别相同就是0,如果预测出的动作类别和观测样本i的动作类别不同就是1;p ic表征对于观测样本i属于动作类别c的预测概率。 Among them: M represents the number of action categories; y ic represents the indicator variable (0 or 1), if the predicted action category is the same as the action category of the observed sample i, it is 0, if the predicted action category is the same as the observed sample i. The difference is 1; pic represents the predicted probability that the observed sample i belongs to the action category c.
基于上述交叉熵损失函数对动作识别模型进行迭代训练,得到动作识别模型的梯度,采用随机梯度下降法更新动作识别模型的参数,直到达到最大迭代次数,得到训练后的动作识别模型。其中,动作识别模型为卷积神经网络模型。Based on the above cross-entropy loss function, the action recognition model is iteratively trained to obtain the gradient of the action recognition model, and the stochastic gradient descent method is used to update the parameters of the action recognition model until the maximum number of iterations is reached, and the trained action recognition model is obtained. Among them, the action recognition model is a convolutional neural network model.
可选的,所述动作识别模型在电子设备的动作识别模块中,当动作识别模块接收到预设窗口长度的传感器数据时,将该预设窗口长度的传感器数据输入到动作识别模型中,动作识别模型对该的预设窗口长度的传感器数据进行推理,输出动作分类结果,并将动作分类结果返回给电子设备。Optionally, the motion recognition model is in the motion recognition module of the electronic device, when the motion recognition module receives the sensor data of the preset window length, the sensor data of the preset window length is input into the motion recognition model, and the action The recognition model infers the sensor data of the preset window length, outputs the action classification result, and returns the action classification result to the electronic device.
S130:若基于所述动作分类结果确定有动作发生时,控制所述电子设备响应于与所述动作对应的操作。S130: If it is determined that an action occurs based on the action classification result, control the electronic device to respond to an operation corresponding to the action.
在本申请实施例中,所述动作可以为点头动作和摇头动作,所述动作也可以为其他头部动作,比如头部向左旋转,头部向右旋转等动作,在此不做具体限定;所述与动作对应的操作可以为预先设置的电子设备可以自动执行的操作,比如翻页操作和返回操作,可选的,所述与动作对应的操作也可以为用户自定义的操作,在此不做具体限定。In this embodiment of the present application, the action may be a nodding action and a shaking head action, and the action may also be other head actions, such as turning the head to the left, turning the head to the right, etc., which are not specifically limited here. ; The operation corresponding to the action can be a preset operation that the electronic device can automatically perform, such as a page turning operation and a return operation. Optionally, the operation corresponding to the action can also be a user-defined operation. This is not specifically limited.
作为一种方式,若与点头动作和摇头动作对应的操作为翻页操作和返回操作,当确定有点头动作发生时,控制电子设备响应于与点头动作对应的翻页操作;当确定有摇头动作发生时,控制电子设备响应于与摇头动作对应的返回操作。可选的,当确定有点头动作发生时,也可以控制电子设备响应于与点头动作对应的返回操作;当确定有摇头动作发生时,也可以控制电子设备响应于与摇头动作对应的翻页操作。As one way, if the operations corresponding to the nodding action and the shaking action are page turning and returning operations, when it is determined that the nodding action occurs, the electronic device is controlled to respond to the page turning operation corresponding to the nodding action; when it is determined that there is a shaking action When this occurs, the control electronics respond to a return operation corresponding to the shaking motion. Optionally, when it is determined that a nodding action occurs, the electronic device can also be controlled to respond to a return operation corresponding to the nodding action; when it is determined that a shaking action occurs, the electronic device can also be controlled to respond to a page turning operation corresponding to the shaking action. .
具体的,电子设备中预先设置有根据动作分类结果给出动作判定的方法,在此叫做决策策略。电子设备结合动作识别模型输出的动作分类结果以及预先设置的动作判断的决策策略判断是否有动作发生,若通过决策策略判断有点头动作发生,则控制电子设备响应于与点头动作对应的翻页操作或返回操作;若通过决策策略判断有摇头动作发生,则控制电子设备响应于与摇头动作对应的返回操作或翻页操作。Specifically, the electronic device is pre-configured with a method for giving action judgment according to the action classification result, which is called a decision strategy here. The electronic device combines the action classification result output by the action recognition model and the preset decision strategy for action judgment to determine whether an action has occurred. If it is determined by the decision strategy that a nodding action occurs, the electronic device is controlled to respond to the page turning operation corresponding to the nodding action. Or return operation; if it is determined through the decision strategy that a shaking motion occurs, the electronic device is controlled to respond to a returning operation or a page turning operation corresponding to the shaking motion.
进一步的,还可以根据动作识别模型输出的动作分类结果对预先设置的决策策略进行调整,比如说n个结果中一共出现几次同一个动作类别,或者同一个动作类别连续出现几次,那么就可以调整决策策略为连续出现几次同一个动作类别才判定有动作 发生。Further, the preset decision-making strategy can also be adjusted according to the action classification results output by the action recognition model. For example, the same action category appears several times in the n results, or the same action category appears several times in a row, then The decision-making strategy can be adjusted to determine that an action occurs when the same action category occurs several times in a row.
可选的,在确定是否有动作发生时,还可以检测动作的幅度是否超过预设幅度,若动作的幅度超过预设幅度,则确定有动作发生。比如,将点头动作的幅度设置为面向地面向下运动10度,若检测到用户的头部面向地面向下运动的幅度超过10度,则确定有点头动作发生;若检测到用户头部面向地面向下运动的幅度没有超过10度,则确定没有点头动作发生。相应的,在确定是否有摇头动作发生时,可以检测用户的头部向左或向右运动的幅度是否超过预设的幅度,进而当检测到用户的头部向左或向右运动的幅度超过预设幅度时,可以确定有摇头动作发生;若检测到用户的头部向左或向右运动的幅度没有超过预设幅度,则确定没有摇头动作发生。示例性的,可以预先将摇头动作的预设幅度设置为向左或向右运动40度,当检测到用户的头部向左或向右运动的幅度超过40度时,可以确定有摇头动作发生,如图4所示;若检测到用户的头部向左或向右运动的幅度没有超过40度,则确定没有摇头动作发生,如图5所示。Optionally, when determining whether an action occurs, it may also be detected whether the magnitude of the action exceeds a preset range, and if the magnitude of the action exceeds the preset range, it is determined that an action occurs. For example, the amplitude of the nodding action is set to face the ground downwards by 10 degrees. If it is detected that the user's head is facing the ground downwards by more than 10 degrees, it is determined that the nodding action has occurred; if it is detected that the user's head is facing the ground If the magnitude of the downward movement does not exceed 10 degrees, it is determined that no nodding action occurs. Correspondingly, when determining whether there is a shaking motion, it can be detected whether the left or right movement of the user's head exceeds the preset range, and then when it is detected that the user's head moves left or right more than When the preset amplitude is used, it can be determined that a shaking motion occurs; if it is detected that the amplitude of the left or right movement of the user's head does not exceed the preset amplitude, it is determined that no shaking motion occurs. Exemplarily, the preset amplitude of the shaking motion may be set to move 40 degrees to the left or right in advance, and when it is detected that the amplitude of the user's head moving to the left or right exceeds 40 degrees, it can be determined that a shaking motion occurs. , as shown in Figure 4; if it is detected that the amplitude of the user's head movement to the left or right does not exceed 40 degrees, it is determined that no shaking action occurs, as shown in Figure 5.
本实施例提供的一种动作识别方法,首先获取预设窗口长度的传感器信号数据,然后将传感器信号数据输入动作识别模型,获取动作识别模型输出的动作分类结果,最后若基于动作分类结果确定有动作发生时,控制电子设备响应于与动作对应的操作。通过上述方法,通过动作识别模型对获取到的时间序列上的预设窗口长度的传感器数据进行识别,可以快速准确的识别出是否有动作发生,提高了动作识别的准确性,进而可以在确定了有动作发生时,控制电子设备实时响应于与该动作对应的操作。In an action recognition method provided in this embodiment, sensor signal data with a preset window length is first obtained, then the sensor signal data is input into an action recognition model, and an action classification result output by the action recognition model is obtained. When an action occurs, the control electronics respond to an operation corresponding to the action. Through the above method, the sensor data of the preset window length in the acquired time series can be identified through the action recognition model, so that whether there is an action can be quickly and accurately identified, the accuracy of the action identification can be improved, and then it can be determined When an action occurs, the control electronic device responds to the operation corresponding to the action in real time.
请参阅图6,本申请实施例提供的一种动作识别方法,应用于电子设备,所述方法包括:Referring to FIG. 6, an action recognition method provided by an embodiment of the present application is applied to an electronic device, and the method includes:
S210:当检测到指定事件时,开始获取头戴设备中的加速度传感器采集的传感器数据,以获取到时间序列上的预设窗口长度的传感器数据。S210: When a designated event is detected, start acquiring sensor data collected by an acceleration sensor in the head mounted device, so as to acquire sensor data with a preset window length on a time series.
作为一种方式,所述传感器数据为头戴设备处于佩戴状态时,头戴设备内置的加速度传感器采集的传感器数据。其中,所述头戴设备为无线蓝牙耳机;所述加速度传感器为三轴加速度传感器。In one way, the sensor data is sensor data collected by an acceleration sensor built in the head-mounted device when the head-mounted device is in a wearing state. Wherein, the head-mounted device is a wireless Bluetooth headset; the acceleration sensor is a three-axis acceleration sensor.
因此,在获取时间序列上的预设窗口长度的传感器数据之前,需要检测无线蓝牙耳机是否处于佩戴状态,可选的,可以通过无线蓝牙耳机内设置的红外传感器来检测无线蓝牙耳机是否处于佩戴状态。其中,需要说明的是,当无线蓝牙耳机被佩戴在人耳朵上时,有部分区域是会被遮挡的,那么在这种情况下,可以在无线蓝牙耳机处于佩戴状态后会被遮挡的区域设置红外传感器的方式,进而可以通过检测红外传感器返回的状态值来确定红外传感器发射的红外信号是否被遮挡,从而可以确定无线蓝牙耳机是否处于佩戴状态还是未佩戴状态。那么可以理解的是,当返回的状态值表征红外信号被遮挡时,确定无线蓝牙耳机处于佩戴状态,当返回的状态值表征红外信号未被遮挡时,确定无线蓝牙耳机处于未佩戴状态。Therefore, before acquiring the sensor data of the preset window length in the time series, it is necessary to detect whether the wireless Bluetooth headset is in a wearing state. Optionally, the infrared sensor set in the wireless Bluetooth headset can be used to detect whether the wireless Bluetooth headset is in a wearing state. . Among them, it should be noted that when the wireless Bluetooth headset is worn on the human ear, some areas will be blocked. In this case, the area that will be blocked after the wireless Bluetooth headset is worn can be set. The method of infrared sensor, and then can determine whether the infrared signal emitted by the infrared sensor is blocked by detecting the state value returned by the infrared sensor, so as to determine whether the wireless Bluetooth headset is in the wearing state or not. Then it can be understood that when the returned state value indicates that the infrared signal is blocked, it is determined that the wireless Bluetooth headset is in a wearing state, and when the returned state value indicates that the infrared signal is not blocked, it is determined that the wireless Bluetooth headset is not wearing.
进一步的,在确定无线蓝牙耳机处于佩戴状态后,还可以获取无线蓝牙耳机的状 态是否处于开机状态。可以的是,无线蓝牙耳机至少可以包括加锁状态、解锁状态、关机状态、开机状态、休眠状态或者其中几种状态的组合,例如,加锁且关机状态、加锁且关机状态、解锁且开机状态、加锁且关机状态等,在此不做限定。具体地,当无线蓝牙耳机处于加锁状态时,表示无线蓝牙耳机的多个功能按键或功能按钮不可操作,以防止用户误触以及防止无线蓝牙耳机未经用户允许被他人使用;当无线蓝牙耳机处于解锁状态时,表示无线蓝牙耳机的多个功能按键或功能按钮可操作,以方便用户对无线蓝牙耳机的功能进行调节,例如,调高音量、调低音量等;当无线蓝牙耳机处于开机状态时,表示无线蓝牙耳机当前可以被使用;当无线蓝牙耳机处于关机状态时,表示无线蓝牙耳机当前不可被使用;当无线蓝牙耳机处于休眠状态时,表示无线蓝牙耳机当前处于待工作状态。Further, after it is determined that the wireless Bluetooth headset is in a wearing state, it is also possible to obtain whether the state of the wireless Bluetooth headset is in a powered-on state. Yes, the wireless Bluetooth headset can at least include a locked state, an unlocked state, an off state, an on state, a sleep state, or a combination of several of these states, for example, a locked and off state, a locked and off state, and an unlocked and powered on state. Status, locked and powered off status, etc., are not limited here. Specifically, when the wireless Bluetooth headset is in a locked state, it means that multiple function keys or function buttons of the wireless Bluetooth headset are inoperable, so as to prevent the user from accidentally touching the wireless Bluetooth headset and to prevent the wireless Bluetooth headset from being used by others without the user's permission; when the wireless Bluetooth headset is inoperable When it is in the unlocked state, it means that multiple function keys or function buttons of the wireless Bluetooth headset can be operated, so as to facilitate the user to adjust the functions of the wireless Bluetooth headset, such as turning up the volume, lowering the volume, etc.; when the wireless Bluetooth headset is turned on , it means that the wireless bluetooth headset can currently be used; when the wireless bluetooth headset is turned off, it means that the wireless bluetooth headset is currently unavailable; when the wireless bluetooth headset is in a dormant state, it means that the wireless bluetooth headset is currently in a standby state.
在本申请实施例中,可以通过检测无线蓝牙耳机的多个功能按键或者功能按钮是否处于正常工作状态,或者检测无线蓝牙耳机当前是否可以被使用来确定无线蓝牙耳机是否处于开机状态。具体的,若检测到无线蓝牙耳机的多个功能按键或功能按钮处于正常工作状态,或者检测到无线蓝牙耳机当前可以被使用,则确定无线蓝牙耳机处于开机状态。In this embodiment of the present application, it can be determined whether the wireless bluetooth headset is powered on by detecting whether multiple function keys or function buttons of the wireless bluetooth headset are in a normal working state, or whether the wireless bluetooth headset can currently be used. Specifically, if it is detected that multiple function keys or function buttons of the wireless Bluetooth headset are in a normal working state, or it is detected that the wireless Bluetooth headset can currently be used, it is determined that the wireless Bluetooth headset is in a powered-on state.
通过上述方法,确定无线蓝牙耳机处于开机状态并处于佩戴状态后,可以检测是否有指定事件发生,当检测到指定事件时,开始获取无线蓝牙耳机中内置的加速度传感器采集传感器数据,以获取时间序列上的预设窗口长度的传感器数据。其中,所述指定事件可以为预先设置的任意可以触发无线蓝牙耳机向电子设备发送传感器数据的事件,比如来电事件。Through the above method, after it is determined that the wireless Bluetooth headset is turned on and worn, it can detect whether a specified event occurs, and when a specified event is detected, the built-in acceleration sensor in the wireless Bluetooth headset starts to collect sensor data to obtain a time series. sensor data over a preset window length. The specified event may be any preset event that can trigger the wireless Bluetooth headset to send sensor data to the electronic device, such as an incoming call event.
可选的,可以通过检测电子设备是否接收到特定的标识来确定是否检测到指定事件。具体的,可以预先为不同的事件设置不同的标识,当有事件发生时,电子设备会先收到与事件对应的标识,进而可以通过检测是否接收到标识来判断是否检测到事件,若接收到标识,则确定检测到事件。进一步的,电子设备可以对接收到的标识进行识别,以通过识别标识来确定是否检测到指定事件。Optionally, whether the specified event is detected may be determined by detecting whether the electronic device receives a specific identification. Specifically, different identifiers can be set for different events in advance. When an event occurs, the electronic device will first receive the identifier corresponding to the event, and then can determine whether the event is detected by detecting whether the identifier is received. flag, it is determined that an event has been detected. Further, the electronic device may identify the received identification, so as to determine whether the specified event is detected by identifying the identification.
示例性的,预先为不同的事件设置不同的标识,其中,事件可以包括来电事件、开机事件以及消息接收事件等,具体的,与上述事件对应的标识可以分别设置为:来电事件对应的标识设置为LDSJ,开机事件对应的标识设置为KJSJ,消息接收事件对应的标识设置为XXSJ。进而可以预先在电子设备设置指定事件,当检测到指定事件后,开始获取无线蓝牙耳机中的加速度传感器采集的传感器数据。比如,预先将指定事件设置为来电事件,进而当电子设备接收到标识确定有事件发生后开始对该标识进行识别,若识别出该标识为LDSJ,则确定检测到指定事件。Exemplarily, different identifiers are set for different events in advance, wherein the events may include incoming call events, power-on events, and message receiving events, etc. Specifically, the identifiers corresponding to the above events may be respectively set as: the identifiers corresponding to the incoming call events are set. For LDSJ, the logo corresponding to the boot event is set to KJSJ, and the logo corresponding to the message receiving event is set to XXSJ. Furthermore, a specified event can be set in the electronic device in advance, and when the specified event is detected, the sensor data collected by the acceleration sensor in the wireless Bluetooth headset is started to be acquired. For example, the specified event is set as an incoming call event in advance, and then when the electronic device receives the identification and determines that an event has occurred, it starts to identify the identification. If the identification is identified as LDSJ, it is determined that the specified event is detected.
在本申请实施例中,由于加速度传感器为三轴加速度传感器,所以在无线蓝牙耳机处于开机状态且处于佩戴状态的情况下,当检测到指定事件时,无线蓝牙耳机内置的三轴加速度传感器开始从X、Y、Z三个维度采集传感器数据。In the embodiment of the present application, since the acceleration sensor is a three-axis acceleration sensor, when the wireless Bluetooth headset is turned on and in the wearing state, when a specified event is detected, the built-in three-axis acceleration sensor of the wireless Bluetooth headset starts to Sensor data is collected in three dimensions: X, Y, and Z.
S220:获取待输入数据的值,所述待输入数据的值为所述预设窗口长度的传感器数据的所有通道的值的平均值。S220: Acquire a value of the data to be input, where the value of the data to be input is an average value of the values of all channels of the sensor data of the preset window length.
通过上述方法,获取到时间序列上的预设窗口长度的传感器数据后,获取预设窗口长度的传感器数据中每个传感器数据的X、Y、Z三个通道的值的平均值,将预设窗口长度的传感器数据中每个传感器数据的X、Y、Z三个通道的值的平均值作为待输入数据的值,其中,所述待输入数据为需要输入到动作识别模型中进行动作识别的传感器数据。Through the above method, after acquiring the sensor data of the preset window length on the time series, the average value of the values of the X, Y, and Z channels of each sensor data in the sensor data of the preset window length is acquired, and the preset window length is obtained. The average value of the values of the X, Y, and Z channels of each sensor data in the sensor data of the window length is used as the value of the data to be input, wherein the data to be input needs to be input into the action recognition model for action recognition. sensor data.
S230:将所述待输入数据的值输入所述动作识别模型,获取所述动作识别模型输出的动作分类结果。S230: Input the value of the data to be input into the action recognition model, and obtain the action classification result output by the action recognition model.
作为一种方式,将通过上述方式获取到的待输入数据的值输入动作识别模型,通过动作识别模型对待输入数据进行识别,进而可以获取动作识别模型输出的动作分类结果。As a method, the value of the data to be input obtained by the above method is input into the action recognition model, and the input data to be recognized is recognized by the action recognition model, and then the action classification result output by the action recognition model can be obtained.
S240:将所述待输入数据进行归零处理。S240: Perform zero return processing on the data to be input.
作为一种方式,为了避免之前的传感器数据对后面输入动作识别模型的传感器数据产生影响,在将待输入数据的值输入动作识别模型后,将待输入数据进行归零的预处理。As a method, in order to avoid previous sensor data from affecting the sensor data input to the action recognition model later, after the value of the data to be input is input into the action recognition model, the data to be input is preprocessed to zero.
在下一次再获取到待输入数据时,再将待输入数据的值设置为当前获取到的预设窗口长度的传感器数据的所有通道的值的平均值,得到新的待输入数据,再将新的待输入数据输入到动作识别模型进行动作识别操作,上述操作在动作识别过程中重复进行。When the data to be input is obtained next time, the value of the data to be input is set as the average value of all channels of the sensor data of the preset window length currently obtained to obtain the new data to be input, and then the new data to be input is obtained. The input data is input into the action recognition model to perform the action recognition operation, and the above operations are repeated during the action recognition process.
本实施例提供的一种动作识别方法,当检测到指定事件时,开始获取头戴设备中的加速度传感器采集的传感器数据,以获取到时间序列上的预设窗口长度的传感器数据,然后获取待输入数据的值,所述待输入数据的值为预设窗口长度的传感器所有通道的值的平均值,将待输入数据的值输入动作识别模型,获取动作识别模型输出的动作分类结果,再将待输入数据进行归零处理。通过上述方法,在将待输入数据的值输入动作识别模型进行动作识别后将待输入数据进行归零处理,可以避免之前的动作识别结果对下一次动作识别产生影响,提高了动作识别的准确性。In an action recognition method provided in this embodiment, when a specified event is detected, sensor data collected by an acceleration sensor in a head-mounted device is started to be obtained, so as to obtain sensor data of a preset window length in a time series, and then the sensor data of a preset window length in a time series is obtained. The value of the input data, the value of the data to be input is the average value of the values of all channels of the sensor with the preset window length, the value of the data to be input is input into the action recognition model, the action classification result output by the action recognition model is obtained, and then the The data to be input is reset to zero. Through the above method, after the value of the data to be input is input into the action recognition model for action recognition, the data to be input is reset to zero, which can avoid the influence of the previous action recognition result on the next action recognition, and improve the accuracy of the action recognition .
请参阅图7,本申请实施例提供的一种动作识别方法,应用于电子设备,所述方法包括:Referring to FIG. 7 , an action recognition method provided by an embodiment of the present application is applied to an electronic device, and the method includes:
S310:当检测到指定事件时,开始获取头戴设备中的加速度传感器采集的传感器数据,以获取到时间序列上的预设窗口长度的传感器数据。S310: When a specified event is detected, start acquiring sensor data collected by an acceleration sensor in the head-mounted device, so as to acquire sensor data with a preset window length on a time series.
作为一种方式,所述指定事件为来电事件;所述时间序列上的预设窗口长度的传感器数据可以为头戴设备中的加速度传感器按照时间顺序采集到预设窗口长度的传感器数据后,再将预设窗口长度的传感器数据发送给电子设备,也可以为头戴设备中的加速度传感器实时采集然后实时向电子设备发送的传感器数据,电子设备在存储到满 足预设窗口长度时,将预设窗口长度的传感器数据输入给动作识别模型。In one way, the specified event is an incoming call event; the sensor data of the preset window length on the time series may be the sensor data of the preset window length collected by the acceleration sensor in the head-mounted device in the time sequence, and then Send the sensor data of the preset window length to the electronic device, or the sensor data collected in real time for the acceleration sensor in the headset and then sent to the electronic device in real time. When the electronic device stores the preset window length to meet the preset window length, the preset The sensor data of the window length is input to the action recognition model.
可选的,在本申请实施例中,一个时刻可以对应一个传感器数据。在电子设备对头戴设备发送的传感器数据进行存储时,为了降低传感器数据维度,可以在头戴设备发送的时间序列上的传感器数据中每隔四个时间点取一个时间点对应的传感器数据进行存储,或者每隔四个传感器数据点取一个传感器数据进行存储。示例性的,头戴设备按照时间顺序发送的传感器数据为“数据1,数据2,数据3,数据4,数据5,数据6,数据7,数据8,数据9,数据10”,电子设备按照每隔四个传感器数据点取一个传感器数据进行存储的方式,存储的传感器数据有“数据1,数据5以及数据9”。Optionally, in this embodiment of the present application, one moment may correspond to one sensor data. When the electronic device stores the sensor data sent by the head-mounted device, in order to reduce the dimension of the sensor data, the sensor data corresponding to one time point can be selected every four time points from the sensor data in the time series sent by the head-mounted device. Store, or take every four sensor data points to store sensor data. Exemplarily, the sensor data sent by the head mounted device in chronological order is "data 1, data 2, data 3, data 4, data 5, data 6, data 7, data 8, data 9, data 10", and the electronic device according to The method of taking one sensor data every four sensor data points for storage, the stored sensor data includes "data 1, data 5 and data 9".
S320:将所述传感器数据输入动作识别模型,获取所述动作识别模型输出的动作分类结果。S320: Input the sensor data into an action recognition model, and obtain an action classification result output by the action recognition model.
S320所包括的步骤的详细解释可以参照前述实施例中的对应步骤,这里不再赘述。For a detailed explanation of the steps included in S320, reference may be made to the corresponding steps in the foregoing embodiments, which will not be repeated here.
S330:将所述动作分类结果保存在预设长度的结果队列中,以对所述结果队列进行更新。S330: Save the action classification result in a result queue with a preset length to update the result queue.
作为一种方式,所述预设长度的结果队列为预先设置的长度为n的保存动作识别模型输出的动作分类结果的队列。该预设长度的结果队列中保存了此前n个预设窗口长度的传感器数据的推理结果。In one way, the result queue of preset length is a preset queue of length n that stores the action classification results output by the action recognition model. The inference results of the sensor data of the previous n preset window lengths are stored in the result queue of the preset length.
在本申请实施例中,对结果队列进行更新可以理解为一旦检测到有新的动作分类结果存储在结果队列中时就是对结果队列进行更新。In this embodiment of the present application, updating the result queue can be understood as updating the result queue once it is detected that a new action classification result is stored in the result queue.
进一步的,由于结果队列的长度为固定的长度,因此在存储满了n个动作分类结果后,再存储新的动作分类结果时,就需要将结果队列中原本存储的动作分类结果进行移除,在上述情况下,可以基于先进先出的原则对结果队列中原本存储的动作分类结果进行移除。在将结果队列中原本存储的动作分类结果移除后,再将新的动作分类结果存储在结果队列中。Further, since the length of the result queue is a fixed length, after storing n action classification results, when a new action classification result is stored, it is necessary to remove the originally stored action classification result in the result queue. In the above case, the action classification results originally stored in the result queue can be removed based on the first-in-first-out principle. After the action classification result originally stored in the result queue is removed, the new action classification result is stored in the result queue.
S340:基于更新后的结果队列判断是否有动作发生。S340: Determine whether an action occurs based on the updated result queue.
在本申请实施例中,通过上述方法对结果队列进行更新后,每一次结果队列发生更新时,都会根据决策策略去判断是否有动作发生。当决策策略判定有动作发生时,电子设备会做一定的冷却处理,电子设备会将新生成的动作分类结果的值设置为预设值,防止电子设备因检测同一个动作而做出连续响应。In the embodiment of the present application, after the result queue is updated by the above method, every time the result queue is updated, it is determined whether an action occurs according to the decision policy. When the decision-making strategy determines that an action occurs, the electronic device will perform a certain cooling process, and the electronic device will set the value of the newly generated action classification result to a preset value to prevent the electronic device from responding continuously due to the detection of the same action.
具体的,在将新生成的动作分类结果的值设置为预设值的步骤可以包括:若基于所述更新后的结果队列中的多个分类结果判断有指定动作发生时,将新生成的分类结果置为预设值后存入所述结果队列;或者,若基于所述更新后的结果队列中的多个分类结果判断有指定动作发生时,将新生成的分类结果存入所述结果队列后置为预设值。其中,所述预设值可以为设置的任意值。Specifically, the step of setting the value of the newly generated action classification result as a preset value may include: if it is determined that a specified action occurs based on multiple classification results in the updated result queue, classifying the newly generated classification result as a preset value. The result is set to a preset value and stored in the result queue; or, if it is judged that a specified action occurs based on multiple classification results in the updated result queue, the newly generated classification result is stored in the result queue. Rear is the default value. Wherein, the preset value may be any set value.
进一步的,如果根据决策策略判断没有动作发生时,为了避免之前的动作识别结果对下一次动作识别产生影响,电子设备将等待一段时间后再将下一个预设窗口长度 的传感器数据输入动作识别模型。Further, if it is judged that no action occurs according to the decision-making strategy, in order to avoid the impact of the previous action recognition result on the next action recognition, the electronic device will wait for a period of time before inputting the sensor data of the next preset window length into the action recognition model. .
S350:当确定有点头动作发生时,控制所述电子设备响应于与所述点头动作对应的接入来电操作。S350: When it is determined that a nodding action occurs, control the electronic device to operate in response to an incoming incoming call corresponding to the nodding action.
作为一种方式,当根据决策策略判断有点头动作发生时,控制电子设备调用接口来响应于与点头动作对应的接入来电操作。In one way, when it is determined that the nodding action occurs according to the decision policy, the control electronic device invokes the interface to respond to the incoming incoming call operation corresponding to the nodding action.
S360:当确定有摇头动作发生时,控制所述电子设备响应于与所述摇头动作对应的拒绝来电操作。S360: When it is determined that a shaking motion occurs, control the electronic device to respond to an operation of rejecting an incoming call corresponding to the shaking motion.
作为一种方式,当根据决策策略判断有摇头动作发生时,控制电子设备响应于与摇头动作对应的拒绝来电操作。In one way, when it is judged that a shaking motion occurs according to the decision policy, the control electronic device responds to an operation of rejecting the incoming call corresponding to the shaking motion.
通过上述方法,用户通过点头动作和摇头动作来控制接听电话和拒接电话,解决了用户在某些场景下不方便用手进行触屏操作的问题,其中应用场景可以包括以下场景:对远距离设备进行操作的应用场景,双手被其他事务占领的应用场景,以及残障人士或不方便进行语音操作时的私密交互的应用场景等。Through the above method, the user controls answering and rejecting calls by nodding and shaking his head, which solves the problem that the user is inconvenient to touch the screen in some scenarios. The application scenarios may include the following scenarios: Application scenarios of device operation, application scenarios of hands being occupied by other affairs, and application scenarios of private interaction when disabled or inconvenient to perform voice operations, etc.
本实施例提供的一种动作识别方法,获取时间序列上的预设窗口长度的传感器数据,将传感器数据输入动作识别模型,获取动作识别模型输出的动作分类结果,再将动作分类结果保存在预设长度的结果队列中,以对结果队列进行更新,然后基于更新后的结果队列判断是否有动作发生,当确定有点头动作发生时,控制电子设备响应于与点头动作对应的接入来电操作,当确定有摇头动作发生时,控制电子设备响应于与摇头动作对应放入拒绝来电操作。通过上述方法,在用户与各电子设备的交互中,用户可以通过点头动作来接听电话,通过摇头动作来拒接电话,解决了用于在某些场景下不方便用手进行触屏操作的痛点,为用户提供了一种全新的基础性交互方式,也为用户多提供了一种交互选择,同时有助于提升各电子设备的表现力,提升了用户体验。In an action recognition method provided in this embodiment, sensor data with a preset window length in a time series is acquired, the sensor data is input into an action recognition model, an action classification result output by the action recognition model is obtained, and the action classification result is stored in a preset Set the length of the result queue to update the result queue, and then judge whether there is an action based on the updated result queue. When it is determined that the nodding action occurs, the control electronic device responds to the nodding action corresponding to the incoming incoming call operation, When it is determined that a shaking motion occurs, the control electronic device puts in an operation of rejecting the incoming call in response to the shaking motion. Through the above method, in the interaction between the user and each electronic device, the user can answer the call by nodding, and reject the call by shaking his head, which solves the inconvenient hand touch screen operation in some scenarios. , which provides users with a brand-new basic interaction method, and also provides users with one more interactive choice, which at the same time helps to improve the expressiveness of each electronic device and enhance the user experience.
请参阅图8,本申请提供的一种动作识别装置400,运行于电子设备,所述装置400包括:数据获取单元410、结果输出单元420以及操作执行单元430。Referring to FIG. 8 , a motion recognition apparatus 400 provided by the present application operates on an electronic device, and the apparatus 400 includes a data acquisition unit 410 , a result output unit 420 and an operation execution unit 430 .
数据获取单元410,用于获取时间序列上的预设窗口长度的传感器数据。The data acquisition unit 410 is configured to acquire sensor data with a preset window length on a time series.
具体的,所述数据获取单元410用于当检测到指定事件时,开始获取头戴设备中的加速度传感器采集的传感器数据,以获取到时间序列上的预设窗口长度的传感器数据。Specifically, the data acquisition unit 410 is configured to start acquiring sensor data collected by an acceleration sensor in the headset when a specified event is detected, so as to acquire sensor data with a preset window length on a time series.
作为一种方式,所述数据获取单元410还用于获取所述电子设备与头戴设备的连接状态;若所述电子设备与所述头戴设备处于连接状态,获取时间序列上的预设窗口长度的传感器数据。In one way, the data acquisition unit 410 is further configured to acquire the connection state between the electronic device and the head-mounted device; if the electronic device and the head-mounted device are in the connection state, acquire a preset window in the time series length of sensor data.
可选的,所述数据获取单元410具体还用于获取所述电子设备的状态值;若所述电子设备的状态值为第一状态值,确定所述电子设备与所述头戴设备处于连接状态;若所述电子设备的状态值为第二状态值,确定所述电子设备与所述头戴设备处于未连接状态。Optionally, the data acquisition unit 410 is further configured to acquire the state value of the electronic device; if the state value of the electronic device is the first state value, it is determined that the electronic device is connected to the head-mounted device. state; if the state value of the electronic device is the second state value, it is determined that the electronic device and the head mounted device are in a disconnected state.
可选的,所述数据获取单元410还用于通过监听广播的方式,确定所述电子设备与所述头戴设备的连接状态。Optionally, the data acquisition unit 410 is further configured to determine the connection state between the electronic device and the head-mounted device by monitoring broadcasts.
作为另一种方式,所述数据获取单元410用于检测所述无线蓝牙耳机是否处于佩戴状态;若所述无线蓝牙耳机处于佩戴状态,获取所述时间序列上的预设窗口长度的传感器数据。As another method, the data acquisition unit 410 is configured to detect whether the wireless Bluetooth headset is in a wearing state; if the wireless Bluetooth headset is in a wearing state, acquire sensor data of a preset window length on the time series.
可选的,所述数据获取单元410还用于获取所述无线蓝牙耳机内设置的红外传感器返回的状态值;若所述状态值表征所述红外传感器发射的红外信号被遮挡,确定所述无线蓝牙耳机处于佩戴状态;若所述状态值表征所述红外传感器发射的红外信号未被遮挡,确定所述无线蓝牙耳机处于未佩戴状态。Optionally, the data acquisition unit 410 is further configured to acquire the status value returned by the infrared sensor set in the wireless Bluetooth headset; if the status value indicates that the infrared signal emitted by the infrared sensor is blocked, it is determined that the wireless The Bluetooth headset is in a wearing state; if the state value indicates that the infrared signal emitted by the infrared sensor is not blocked, it is determined that the wireless Bluetooth headset is in an unworn state.
可选的,所述数据获取单元410还用于若所述无线蓝牙耳机处于佩戴状态,检测所述无线蓝牙耳机是否处于开机状态;当确定所述无线蓝牙耳机处于佩戴状态且处于开机状态时,获取时间序列上的预设窗口长度的传感器数据。Optionally, the data acquisition unit 410 is further configured to detect whether the wireless Bluetooth headset is in a powered-on state if the wireless Bluetooth headset is in a wearing state; when it is determined that the wireless Bluetooth headset is in a wearing state and is in a powered-on state, Acquire sensor data for a preset window length over a time series.
可选的,所述数据获取单元410还用于检测所述无线蓝牙耳机的多个功能按键或者功能按钮是否处于正常工作状态;若检测到所述无线蓝牙耳机的多个功能按键或者功能按钮处于正常工作状态,确定所述无线蓝牙耳机处于开机状态。Optionally, the data acquisition unit 410 is further configured to detect whether multiple function keys or function buttons of the wireless Bluetooth headset are in a normal working state; if it is detected that multiple function keys or function buttons of the wireless Bluetooth headset are in In a normal working state, it is determined that the wireless Bluetooth headset is in a powered-on state.
可选的,所述数据获取单元410还用于检测所述无线蓝牙耳机当前是否可以被使用;若检测到所述无线蓝牙耳机当前可以被使用,确定所述无线蓝牙耳机处于开机状态。Optionally, the data acquisition unit 410 is further configured to detect whether the wireless Bluetooth headset can currently be used; if it is detected that the wireless Bluetooth headset can currently be used, it is determined that the wireless Bluetooth headset is in a powered-on state.
结果输出单元420,用于将所述传感器数据输入动作识别模型,获取所述动作识别模型输出的动作分类结果。The result output unit 420 is configured to input the sensor data into the action recognition model, and obtain the action classification result output by the action recognition model.
其中,所述动作识别模型包括依次连接的第一卷积层,第二卷积层,最大池化层,第三卷积层,第四卷积层,全局平均池化层,全连层以及softmax层;所述第一卷积层和所述第二卷积层为卷积核为7且维度为64的卷积层;所述第三卷积层和所述第四卷积层为卷积核为7且维度为128的卷积层。The action recognition model includes a first convolutional layer, a second convolutional layer, a maximum pooling layer, a third convolutional layer, a fourth convolutional layer, a global average pooling layer, a fully connected layer and softmax layer; the first convolutional layer and the second convolutional layer are convolutional layers with a convolution kernel of 7 and a dimension of 64; the third convolutional layer and the fourth convolutional layer are volumes A convolutional layer with a kernel of 7 and a dimension of 128.
基于交叉熵损失函数,对卷积神经网络模型进行迭代训练,直至迭代次数达到最大迭代次数,将达到最大迭代次数时的卷积神经网络模型作为所述动作识别模型。Based on the cross-entropy loss function, the convolutional neural network model is iteratively trained until the number of iterations reaches the maximum number of iterations, and the convolutional neural network model when the maximum number of iterations is reached is used as the action recognition model.
其中,所述交叉熵损失函数为
Figure PCTCN2021139746-appb-000002
其中:所述M表征动作类别的数量;所述y ic表征指示变量;所述p ic表征对于观测样本i属于动作类别c的预测概率。
Wherein, the cross entropy loss function is
Figure PCTCN2021139746-appb-000002
Wherein: the M represents the number of action categories; the y ic represents the indicator variable; the pic represents the predicted probability that the observed sample i belongs to the action category c.
具体的,所述结果输出单元420用于获取待输入数据的值,所述待输入数据的值为所述预设窗口长度的传感器数据的所有通道的值的平均值;将所述待输入数据的值输入所述动作识别模型,获取所述动作识别模型输出的动作分类结果;将所述待输入数据进行归零处理。Specifically, the result output unit 420 is configured to obtain the value of the data to be input, where the value of the data to be input is the average value of the values of all channels of the sensor data of the preset window length; The value of the action recognition model is input to the action recognition model, and the action classification result output by the action recognition model is obtained; the data to be input is zeroed.
操作执行单元430,用于若基于所述动作分类结果确定有动作发生时,控制所述电 子设备响应于与所述动作对应的操作。The operation execution unit 430 is configured to control the electronic device to respond to an operation corresponding to the action if it is determined that an action occurs based on the action classification result.
具体的,所述操作执行单元430用于当确定有点头动作发生时,控制所述电子设备响应于与所述点头动作对应的接入来电操作;当确定有摇头动作发生时,控制所述电子设备响应于与所述摇头动作对应的拒绝来电操作。Specifically, the operation execution unit 430 is configured to control the electronic device to respond to an incoming call operation corresponding to the nodding action when it is determined that a nodding action occurs; when it is determined that a shaking action occurs, control the electronic device The device responds to an operation of rejecting the incoming call corresponding to the shaking motion.
请参阅图9,所述装置400还包括:Referring to FIG. 9, the apparatus 400 further includes:
动作判断单元440,用于将所述动作分类结果保存在预设长度的结果队列中,以对所述结果队列进行更新;基于更新后的结果队列判断是否有动作发生。The action judging unit 440 is configured to save the action classification result in a result queue of preset length, so as to update the result queue; and determine whether an action occurs based on the updated result queue.
请参阅图10,所述装置400还包括:Referring to FIG. 10, the apparatus 400 further includes:
结果保存单元450,用于若基于所述更新后的结果队列中的多个分类结果判断有指定动作发生时,将新生成的分类结果置为预设值后存入所述结果队列;或者,若基于所述更新后的结果队列中的多个分类结果判断有指定动作发生时,将新生成的分类结果存入所述结果队列后置为预设值。The result saving unit 450 is configured to set the newly generated classification result as a preset value and store it in the result queue if it is judged that a specified action occurs based on a plurality of classification results in the updated result queue; or, If it is determined that a specified action occurs based on the plurality of classification results in the updated result queue, the newly generated classification result is stored in the result queue and set as a preset value.
需要说明的是,本申请中装置实施例与前述方法实施例是相互对应的,装置实施例中具体的原理可以参见前述方法实施例中的内容,此处不再赘述。It should be noted that the apparatus embodiments in the present application correspond to the foregoing method embodiments, and the specific principles in the apparatus embodiments may refer to the content in the foregoing method embodiments, which will not be repeated here.
下面将结合图11对本申请提供的一种电子设备进行说明。An electronic device provided by the present application will be described below with reference to FIG. 11 .
请参阅图11,基于上述的动作识别方法、装置,本申请实施例还提供的另一种可以执行前述动作识别方法的电子设备100。电子设备100包括相互耦合的一个或多个(图中仅示出一个)处理器102、存储器104以及网络模块106。其中,该存储器104中存储有可以执行前述实施例中内容的程序,而处理器102可以执行该存储器104中存储的程序。Referring to FIG. 11 , based on the above-mentioned action recognition method and apparatus, an embodiment of the present application further provides another electronic device 100 that can execute the foregoing action recognition method. The electronic device 100 includes one or more (only one shown in the figure) a processor 102, a memory 104, and a network module 106 that are coupled to each other. Wherein, the memory 104 stores a program that can execute the content in the foregoing embodiments, and the processor 102 can execute the program stored in the memory 104 .
其中,处理器102可以包括一个或者多个处理核。处理器102利用各种接口和线路连接整个电子设备100内的各个部分,通过运行或执行存储在存储器104内的指令、程序、代码集或指令集,以及调用存储在存储器104内的数据,执行电子设备100的各种功能和处理数据。可选地,处理器102可以采用数字信号处理(Digital Signal Processing,DSP)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)、可编程逻辑阵列(Programmable Logic Array,PLA)中的至少一种硬件形式来实现。处理器102可集成中央处理器(Central Processing Unit,CPU)、图像处理器(Graphics Processing Unit,GPU)和调制解调器等中的一种或几种的组合。其中,CPU主要处理操作系统、用户界面和应用程序等;GPU用于负责显示内容的渲染和绘制;调制解调器用于处理无线通信。可以理解的是,上述调制解调器也可以不集成到处理器102中,单独通过一块通信芯片进行实现。The processor 102 may include one or more processing cores. The processor 102 uses various interfaces and lines to connect various parts of the entire electronic device 100, and executes by running or executing the instructions, programs, code sets or instruction sets stored in the memory 104, and calling the data stored in the memory 104. Various functions of the electronic device 100 and processing data. Optionally, the processor 102 may adopt at least one of digital signal processing (Digital Signal Processing, DSP), field-programmable gate array (Field-Programmable Gate Array, FPGA), and programmable logic array (Programmable Logic Array, PLA). implemented in a hardware form. The processor 102 may integrate one or a combination of a central processing unit (Central Processing Unit, CPU), a graphics processing unit (Graphics Processing Unit, GPU), a modem, and the like. Among them, the CPU mainly handles the operating system, user interface and application programs, etc.; the GPU is used for rendering and drawing of the display content; the modem is used to handle wireless communication. It can be understood that, the above-mentioned modem may not be integrated into the processor 102, and is implemented by a communication chip alone.
存储器104可以包括随机存储器(Random Access Memory,RAM),也可以包括只读存储器(Read-Only Memory,ROM)。存储器104可用于存储指令、程序、代码、代码集或指令集。存储器104可包括存储程序区和存储数据区,其中,存储程序区可存储用于实现操作系统的指令、用于实现至少一个功能的指令(比如触控功能、声音播放功能、图像播放功能等)、用于实现下述各个方法实施例的指令等。例如,存储器104中可以存储有动作识别的装置。该设动作识别的装置可以为前述的装置400。存储数据区还可以存储电 子设备100在使用中所创建的数据(比如电话本、音视频数据、聊天记录数据)等。The memory 104 may include a random access memory (Random Access Memory, RAM), or may include a read-only memory (Read-Only Memory, ROM). Memory 104 may be used to store instructions, programs, codes, sets of codes, or sets of instructions. The memory 104 may include a stored program area and a stored data area, wherein the stored program area may store instructions for implementing the operating system, instructions for implementing at least one function (such as a touch function, a sound playback function, an image playback function, etc.) , instructions for implementing the following method embodiments, and the like. For example, the memory 104 may store motion recognition means. The device for setting motion recognition may be the aforementioned device 400 . The storage data area can also store data (such as phone book, audio and video data, chat record data) created by the electronic device 100 during use.
所述网络模块106用于接收以及发送电磁波,实现电磁波与电信号的相互转换,从而与通讯网络或者其他设备进行通讯,例如和音频播放设备进行通讯。所述网络模块106可包括各种现有的用于执行这些功能的电路元件,例如,天线、射频收发器、数字信号处理器、加密/解密芯片、用户身份模块(SIM)卡、存储器等等。所述网络模块106可与各种网络如互联网、企业内部网、无线网络进行通讯或者通过无线网络与其他设备进行通讯。上述的无线网络可包括蜂窝式电话网、无线局域网或者城域网。例如,网络模块106可以与基站进行信息交互。The network module 106 is used for receiving and sending electromagnetic waves, realizing mutual conversion between electromagnetic waves and electrical signals, so as to communicate with a communication network or other devices, for example, communicate with an audio playback device. The network module 106 may include various existing circuit elements for performing these functions, eg, antennas, radio frequency transceivers, digital signal processors, encryption/decryption chips, subscriber identity module (SIM) cards, memory, etc. . The network module 106 can communicate with various networks such as the Internet, an intranet, a wireless network, or communicate with other devices through a wireless network. The aforementioned wireless network may include a cellular telephone network, a wireless local area network, or a metropolitan area network. For example, the network module 106 may interact with the base station for information.
请参考图12,其示出了本申请实施例提供的一种计算机可读存储介质的结构框图。该计算机可读存储介质800中存储有程序代码,所述程序代码可被处理器调用执行上述方法实施例中所描述的方法。Please refer to FIG. 12 , which shows a structural block diagram of a computer-readable storage medium provided by an embodiment of the present application. The computer-readable storage medium 800 stores program codes, and the program codes can be invoked by the processor to execute the methods described in the above method embodiments.
计算机可读存储介质800可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。可选地,计算机可读存储介质800包括非易失性计算机可读介质(non-transitory computer-readable storage medium)。计算机可读存储介质800具有执行上述方法中的任何方法步骤的程序代码810的存储空间。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。程序代码810可以例如以适当形式进行压缩。The computer readable storage medium 800 may be an electronic memory such as flash memory, EEPROM (Electrically Erasable Programmable Read Only Memory), EPROM, hard disk, or ROM. Optionally, the computer-readable storage medium 800 includes a non-transitory computer-readable storage medium. Computer readable storage medium 800 has storage space for program code 810 to perform any of the method steps in the above-described methods. These program codes can be read from or written to one or more computer program products. Program code 810 may be compressed, for example, in a suitable form.
本实施例提供的一种动作识别方法、装置、电子设备以及存储介质,首先获取预设窗口长度的传感器信号数据,然后将传感器信号数据输入动作识别模型,获取动作识别模型输出的动作分类结果,最后若基于动作分类结果确定有动作发生时,控制电子设备响应于与动作对应的操作。通过上述方法,通过动作识别模型对获取到的时间序列上的预设窗口长度的传感器数据进行识别,可以快速准确的识别出是否有动作发生,提高了动作识别的准确性,进而可以在确定了有动作发生时,控制电子设备实时响应于与该动作对应的操作。In an action recognition method, device, electronic device, and storage medium provided by this embodiment, sensor signal data with a preset window length is first obtained, and then the sensor signal data is input into an action recognition model to obtain an action classification result output by the action recognition model, Finally, if it is determined that an action occurs based on the action classification result, the electronic device is controlled to respond to an operation corresponding to the action. Through the above method, the sensor data of the preset window length in the acquired time series can be recognized by the motion recognition model, so that whether there is an action can be quickly and accurately identified, the accuracy of the action recognition can be improved, and then it can be determined When an action occurs, the control electronic device responds to the operation corresponding to the action in real time.
最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不驱使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present application, but not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand: it can still be Modifications are made to the technical solutions described in the foregoing embodiments, or some technical features thereof are equivalently replaced; and these modifications or replacements do not drive the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the present application.

Claims (20)

  1. 一种动作识别方法,其特征在于,应用于电子设备,所述方法包括:An action recognition method, characterized in that, applied to an electronic device, the method comprising:
    获取时间序列上的预设窗口长度的传感器数据;Obtain sensor data for a preset window length on a time series;
    将所述传感器数据输入动作识别模型,获取所述动作识别模型输出的动作分类结果;Inputting the sensor data into an action recognition model to obtain an action classification result output by the action recognition model;
    若基于所述动作分类结果确定有动作发生时,控制所述电子设备响应于与所述动作对应的操作。If it is determined that an action occurs based on the action classification result, the electronic device is controlled to respond to an operation corresponding to the action.
  2. 根据权利要求1所述的方法,其特征在于,所述获取时间序列上的预设窗口长度的传感器信号数据,包括:The method according to claim 1, wherein the acquiring sensor signal data of a preset window length on a time series comprises:
    当检测到指定事件时,开始获取头戴设备中的加速度传感器采集的传感器数据,以获取到时间序列上的预设窗口长度的传感器数据。When a specified event is detected, the sensor data collected by the acceleration sensor in the head-mounted device is acquired, so as to acquire sensor data with a preset window length on the time series.
  3. 根据权利要求2所述的方法,其特征在于,所述将所述传感器数据输入动作识别模型,获取所述动作识别模型输出的动作分类结果,包括:The method according to claim 2, wherein the inputting the sensor data into an action recognition model, and obtaining an action classification result output by the action recognition model, comprises:
    获取待输入数据的值,所述待输入数据的值为所述预设窗口长度的传感器数据的所有通道的值的平均值;obtaining the value of the data to be input, where the value of the data to be input is the average value of the values of all channels of the sensor data of the preset window length;
    将所述待输入数据的值输入所述动作识别模型,获取所述动作识别模型输出的动作分类结果;Input the value of the data to be input into the action recognition model, and obtain the action classification result output by the action recognition model;
    将所述待输入数据进行归零处理。The data to be input is reset to zero.
  4. 根据权利要求2所述的方法,其特征在于,所述指定事件为来电事件,所述若基于所述动作分类结果确定有动作发生时,控制所述电子设备响应于与所述动作对应的操作,包括:The method according to claim 2, wherein the designated event is an incoming call event, and if it is determined that an action occurs based on the action classification result, the electronic device is controlled to respond to an operation corresponding to the action ,include:
    当确定有点头动作发生时,控制所述电子设备响应于与所述点头动作对应的接入来电操作;When it is determined that a nodding action occurs, controlling the electronic device to operate in response to an incoming incoming call corresponding to the nodding action;
    当确定有摇头动作发生时,控制所述电子设备响应于与所述摇头动作对应的拒绝来电操作。When it is determined that a shaking motion occurs, the electronic device is controlled to respond to an operation of rejecting an incoming call corresponding to the shaking motion.
  5. 根据权利要求1所述的方法,其特征在于,所述若基于所述动作分类结果确定有动作发生时,控制所述电子设备响应于与所述动作对应的操作之前还包括:The method according to claim 1, wherein if it is determined that an action occurs based on the action classification result, before controlling the electronic device to respond to an operation corresponding to the action, the method further comprises:
    将所述动作分类结果保存在预设长度的结果队列中,以对所述结果队列进行更新;saving the action classification result in a result queue of preset length to update the result queue;
    基于更新后的结果队列判断是否有动作发生。Based on the updated result queue, it is determined whether an action has occurred.
  6. 根据权利要求5所述的方法,其特征在于,所述方法还包括:The method according to claim 5, wherein the method further comprises:
    若基于所述更新后的结果队列中的多个分类结果判断有指定动作发生时,将新生成的分类结果置为预设值后存入所述结果队列;或者,If it is determined that a specified action occurs based on multiple classification results in the updated result queue, the newly generated classification result is set to a preset value and then stored in the result queue; or,
    若基于所述更新后的结果队列中的多个分类结果判断有指定动作发生时,将新生 成的分类结果存入所述结果队列后置为预设值。If it is judged that a designated action occurs based on a plurality of classification results in the updated result queue, the newly generated classification result is stored in the result queue and set as a preset value.
  7. 根据权利要求1所述的方法,其特征在于,所述获取时间序列上的预设窗口长度的传感器数据,包括:The method according to claim 1, wherein the acquiring sensor data of a preset window length on a time series comprises:
    获取所述电子设备与头戴设备的连接状态;Obtain the connection status between the electronic device and the head-mounted device;
    若所述电子设备与所述头戴设备处于连接状态,获取时间序列上的预设窗口长度的传感器数据。If the electronic device is in a connected state with the head mounted device, acquire sensor data of a preset window length on a time series.
  8. 根据权利要求7所述的方法,其特征在于,所述获取所述电子设备与头戴设备的连接状态,包括:The method according to claim 7, wherein the acquiring the connection status between the electronic device and the head-mounted device comprises:
    获取所述电子设备的状态值;obtain the status value of the electronic device;
    若所述电子设备的状态值为第一状态值,确定所述电子设备与所述头戴设备处于连接状态;If the state value of the electronic device is a first state value, determine that the electronic device and the head-mounted device are in a connected state;
    若所述电子设备的状态值为第二状态值,确定所述电子设备与所述头戴设备处于未连接状态。If the state value of the electronic device is the second state value, it is determined that the electronic device and the head mounted device are in a disconnected state.
  9. 根据权利要求8所述的方法,其特征在于,所述方法还包括:The method according to claim 8, wherein the method further comprises:
    通过监听广播的方式,确定所述电子设备与所述头戴设备的连接状态。The connection state between the electronic device and the head-mounted device is determined by monitoring the broadcast.
  10. 根据权利要求7-9任一所述的方法,其特征在于,所述头戴设备为无线蓝牙耳机;所述获取时间序列上的预设窗口长度的传感器数据,包括:The method according to any one of claims 7-9, wherein the head-mounted device is a wireless Bluetooth headset; the acquiring sensor data of a preset window length on a time series comprises:
    检测所述无线蓝牙耳机是否处于佩戴状态;Detecting whether the wireless Bluetooth headset is in a wearing state;
    若所述无线蓝牙耳机处于佩戴状态,获取所述时间序列上的预设窗口长度的传感器数据。If the wireless Bluetooth headset is in a wearing state, acquire sensor data of a preset window length on the time series.
  11. 根据权利要求10所述的方法,其特征在于,所述检测所述无线蓝牙耳机是否处于佩戴状态,包括:The method according to claim 10, wherein the detecting whether the wireless Bluetooth headset is in a wearing state comprises:
    获取所述无线蓝牙耳机内设置的红外传感器返回的状态值;Obtain the status value returned by the infrared sensor set in the wireless Bluetooth headset;
    若所述状态值表征所述红外传感器发射的红外信号被遮挡,确定所述无线蓝牙耳机处于佩戴状态;If the state value indicates that the infrared signal emitted by the infrared sensor is blocked, it is determined that the wireless Bluetooth headset is in a wearing state;
    若所述状态值表征所述红外传感器发射的红外信号未被遮挡,确定所述无线蓝牙耳机处于未佩戴状态。If the state value indicates that the infrared signal emitted by the infrared sensor is not blocked, it is determined that the wireless Bluetooth headset is in an unworn state.
  12. 根据权利要求7-9任一所述的方法,其特征在于,所述头戴设备为无线蓝牙耳机;所述获取时间序列上的预设窗口长度的传感器数据,包括:The method according to any one of claims 7-9, wherein the head-mounted device is a wireless Bluetooth headset; the acquiring sensor data of a preset window length on a time series comprises:
    若所述无线蓝牙耳机处于佩戴状态,检测所述无线蓝牙耳机是否处于开机状态;If the wireless Bluetooth headset is in a wearing state, detect whether the wireless Bluetooth headset is in a powered-on state;
    当确定所述无线蓝牙耳机处于佩戴状态且处于开机状态时,获取时间序列上的预设窗口长度的传感器数据。When it is determined that the wireless Bluetooth headset is in a wearing state and is in a power-on state, sensor data of a preset window length on a time series is acquired.
  13. 根据权利要求12所述的方法,其特征在于,所述检测所述无线蓝牙耳机是否处于开机状态,包括:The method according to claim 12, wherein the detecting whether the wireless Bluetooth headset is turned on comprises:
    检测所述无线蓝牙耳机的多个功能按键或者功能按钮是否处于正常工作状态;Detecting whether multiple function keys or function buttons of the wireless Bluetooth headset are in normal working state;
    若检测到所述无线蓝牙耳机的多个功能按键或者功能按钮处于正常工作状态,确定所述无线蓝牙耳机处于开机状态。If it is detected that multiple function keys or function buttons of the wireless Bluetooth headset are in a normal working state, it is determined that the wireless Bluetooth headset is in a power-on state.
  14. 根据权利要求12所述的方法,其特征在于,所述检测所述无线蓝牙耳机是否处于开机状态,包括:The method according to claim 12, wherein the detecting whether the wireless Bluetooth headset is turned on comprises:
    检测所述无线蓝牙耳机当前是否可以被使用;Detecting whether the wireless Bluetooth headset can currently be used;
    若检测到所述无线蓝牙耳机当前可以被使用,确定所述无线蓝牙耳机处于开机状态。If it is detected that the wireless Bluetooth headset can currently be used, it is determined that the wireless Bluetooth headset is in a powered-on state.
  15. 根据权利要求1-14任一所述的方法,其特征在于,所述动作识别模型包括依次连接的第一卷积层,第二卷积层,最大池化层,第三卷积层,第四卷积层,全局平均池化层,全连层以及softmax层;所述第一卷积层和所述第二卷积层为卷积核为7且维度为64的卷积层;所述第三卷积层和所述第四卷积层为卷积核为7且维度为128的卷积层。The method according to any one of claims 1-14, wherein the action recognition model comprises a first convolutional layer, a second convolutional layer, a maximum pooling layer, a third convolutional layer, a Four convolutional layers, a global average pooling layer, a fully connected layer and a softmax layer; the first convolutional layer and the second convolutional layer are convolutional layers with a convolution kernel of 7 and a dimension of 64; the The third convolutional layer and the fourth convolutional layer are convolutional layers with a convolution kernel of 7 and a dimension of 128.
  16. 根据权利要求1-14任一所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-14, wherein the method further comprises:
    基于交叉熵损失函数,对卷积神经网络模型进行迭代训练,直至迭代次数达到最大迭代次数,将达到最大迭代次数时的卷积神经网络模型作为所述动作识别模型。Based on the cross-entropy loss function, the convolutional neural network model is iteratively trained until the number of iterations reaches the maximum number of iterations, and the convolutional neural network model when the maximum number of iterations is reached is used as the action recognition model.
  17. 根据权利要求16所述的方法,其特征在于,所述交叉熵损失函数为The method according to claim 16, wherein the cross entropy loss function is
    Figure PCTCN2021139746-appb-100001
    其中:所述M表征动作类别的数量;所述y ic表征指示变量;所述p ic表征对于观测样本i属于动作类别c的预测概率。
    Figure PCTCN2021139746-appb-100001
    Wherein: the M represents the number of action categories; the y ic represents the indicator variable; the pic represents the predicted probability that the observed sample i belongs to the action category c.
  18. 一种动作识别装置,其特征在于,运行于电子设备,所述装置包括:A motion recognition device, characterized in that it runs on electronic equipment, and the device includes:
    数据获取单元,用于获取时间序列上的预设窗口长度的传感器数据;a data acquisition unit, used for acquiring sensor data of a preset window length on a time series;
    结果输出单元,用于将所述传感器数据输入动作识别模型,获取所述动作识别模型输出的动作分类结果;a result output unit, configured to input the sensor data into an action recognition model, and obtain an action classification result output by the action recognition model;
    操作执行单元,用于若基于所述动作分类结果确定有动作发生时,控制所述电子设备响应于与所述动作对应的操作。The operation execution unit is configured to control the electronic device to respond to an operation corresponding to the action if it is determined that an action occurs based on the action classification result.
  19. 一种电子设备,其特征在于,包括一个或多个处理器以及存储器;一个或多个程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行权利要求1-17任一所述的方法。An electronic device, characterized by comprising one or more processors and a memory; one or more programs are stored in the memory and configured to be executed by the one or more processors of claims 1-17 any of the methods described.
  20. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质中存储有程序代码,其中,在所述程序代码被处理器运行时执行权利要求1-17任一所述的方法。A computer-readable storage medium, characterized in that a program code is stored in the computer-readable storage medium, wherein the method of any one of claims 1-17 is executed when the program code is executed by a processor.
PCT/CN2021/139746 2021-01-28 2021-12-20 Action recognition method and apparatus, and electronic device and storage medium WO2022161026A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110118213.4 2021-01-28
CN202110118213.4A CN112817450A (en) 2021-01-28 2021-01-28 Action recognition method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2022161026A1 true WO2022161026A1 (en) 2022-08-04

Family

ID=75860099

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/139746 WO2022161026A1 (en) 2021-01-28 2021-12-20 Action recognition method and apparatus, and electronic device and storage medium

Country Status (2)

Country Link
CN (2) CN112817450A (en)
WO (1) WO2022161026A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112817450A (en) * 2021-01-28 2021-05-18 Oppo广东移动通信有限公司 Action recognition method and device, electronic equipment and storage medium
CN114020382A (en) * 2021-10-29 2022-02-08 杭州逗酷软件科技有限公司 Execution method, electronic equipment and computer storage medium
CN114067623B (en) * 2021-12-01 2024-03-22 云南民族大学 Non-paper surface contact foreign language learning responder

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110110616A (en) * 2019-04-19 2019-08-09 出门问问信息科技有限公司 A kind of electronic equipment and control method
CN110348494A (en) * 2019-06-27 2019-10-18 中南大学 A kind of human motion recognition method based on binary channels residual error neural network
CN110991482A (en) * 2019-10-31 2020-04-10 深圳市木瓜有子健康科技有限公司 Body-building action recognition method, terminal and computer storage medium
CN111200745A (en) * 2019-12-31 2020-05-26 歌尔股份有限公司 Viewpoint information acquisition method, apparatus, device and computer storage medium
CN112817450A (en) * 2021-01-28 2021-05-18 Oppo广东移动通信有限公司 Action recognition method and device, electronic equipment and storage medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018141409A1 (en) * 2017-02-06 2018-08-09 Telefonaktiebolaget Lm Ericsson (Publ) Initiating a control operation in response to a head gesture
CN106919958B (en) * 2017-03-21 2020-06-26 电子科技大学 Human body finger action recognition method based on smart watch
CN108062170A (en) * 2017-12-15 2018-05-22 南京师范大学 Multi-class human posture recognition method based on convolutional neural networks and intelligent terminal
KR102057705B1 (en) * 2018-01-31 2019-12-19 계명대학교 산학협력단 A smart hand device for gesture recognition and control method thereof
CN109508677B (en) * 2018-11-15 2021-07-13 电子科技大学 Improved CNN network-based recognition method for handwriting actions in air
CN109731302A (en) * 2019-01-22 2019-05-10 深圳职业技术学院 Athletic posture recognition methods, device and electronic equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110110616A (en) * 2019-04-19 2019-08-09 出门问问信息科技有限公司 A kind of electronic equipment and control method
CN110348494A (en) * 2019-06-27 2019-10-18 中南大学 A kind of human motion recognition method based on binary channels residual error neural network
CN110991482A (en) * 2019-10-31 2020-04-10 深圳市木瓜有子健康科技有限公司 Body-building action recognition method, terminal and computer storage medium
CN111200745A (en) * 2019-12-31 2020-05-26 歌尔股份有限公司 Viewpoint information acquisition method, apparatus, device and computer storage medium
CN112817450A (en) * 2021-01-28 2021-05-18 Oppo广东移动通信有限公司 Action recognition method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN112817450A (en) 2021-05-18
CN114167984A (en) 2022-03-11
CN114167984B (en) 2024-03-12

Similar Documents

Publication Publication Date Title
WO2022161026A1 (en) Action recognition method and apparatus, and electronic device and storage medium
US10169639B2 (en) Method for fingerprint template update and terminal device
CN107360327A (en) Audio recognition method, device and storage medium
WO2018113512A1 (en) Image processing method and related device
WO2018133874A1 (en) Method and device for sending warning message
CN108427873B (en) Biological feature identification method and mobile terminal
CN109947650B (en) Script step processing method, device and system
CN109561420B (en) Emergency help-seeking method and related equipment
CN110765924B (en) Living body detection method, living body detection device and computer readable storage medium
CN107885448B (en) Control method for application touch operation, mobile terminal and readable storage medium
WO2017088434A1 (en) Human face model matrix training method and apparatus, and storage medium
CN109256151B (en) Call voice regulation and control method and device, mobile terminal and readable storage medium
CN117130469B (en) Space gesture recognition method, electronic equipment and chip system
CN108765522B (en) Dynamic image generation method and mobile terminal
CN115187824A (en) Model training method, scene recognition method and related equipment
CN112329926A (en) Quality improvement method and system for intelligent robot
CN111046742B (en) Eye behavior detection method, device and storage medium
CN107844759A (en) A kind of gesture identification method, terminal and storage medium
CN107944242B (en) Biological identification function disabling method and mobile terminal
CN109885224A (en) Interface of input method display methods, storage medium and mobile terminal
CN111027406B (en) Picture identification method and device, storage medium and electronic equipment
CN115240250A (en) Model training method and device, computer equipment and readable storage medium
CN110852217B (en) Face recognition method and electronic equipment
CN110084016B (en) Face recognition unlocking method and device, mobile terminal and storage medium
CN108958505B (en) Method and terminal for displaying candidate information

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21922616

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21922616

Country of ref document: EP

Kind code of ref document: A1