CN110955327B

CN110955327B - Method for starting and closing intelligent equipment, storage device and terminal

Info

Publication number: CN110955327B
Application number: CN201811133114.8A
Authority: CN
Inventors: 王燕飞
Original assignee: Qiku Internet Network Scientific Shenzhen Co ltd
Current assignee: Qiku Internet Network Scientific Shenzhen Co ltd
Priority date: 2018-09-27
Filing date: 2018-09-27
Publication date: 2023-06-09
Anticipated expiration: 2038-09-27
Also published as: CN110955327A

Abstract

The invention discloses a method, a storage device and a terminal for starting and closing intelligent equipment, which are used for comparing recognized voice information and/or gesture information with model information by acquiring the information, and starting the equipment if the similarity reaches a set value. According to the intelligent device and the intelligent control method, the intelligent device is started quickly through the combination of the voice information and the gestures, and the intelligent device can still be started and closed normally even when the voice function is affected by external noise or inconvenient for a user.

Description

Method for starting and closing intelligent equipment, storage device and terminal

Technical Field

The present invention relates to the field of terminal technologies, and in particular, to a method for starting and closing an intelligent device, a storage device, and a terminal.

Background

Along with the development of computer technology, people's life has gradually moved into the intelligent era nowadays, and relevant products such as smart phones, smart televisions, smart speakers and the like will provide convenient and fast services in various aspects of people's life. Nevertheless, at present, human-computer interaction still does not reach a mature level, for example, the accuracy of voice recognition when starting certain intelligent home has a better voice recognition result in an environment with a near sound source and high quality of voice signals; however, when the specific position of the sound source is far away and the noise, reverberation and reflection exist in the environment, the recognition rate is still low. Therefore, how to provide a method for accurately starting the intelligent device is a problem to be solved.

Disclosure of Invention

The invention mainly aims to provide a method for starting and closing intelligent equipment, a storage device and a terminal, and aims to solve the problems.

In order to achieve the above object, the present invention provides a method for starting and closing an intelligent device, the method comprising the steps of:

acquiring voice information and/or gesture information of a user;

comparing the voice information and/or gesture information with a corresponding voice template or gesture template;

and starting if the similarity between the voice information and the voice template reaches a first threshold value and/or the similarity between the gesture information and the gesture template reaches a second threshold value.

Further, before the step of acquiring the voice information and/or gesture information of the user, the method includes:

when a user is sensed through the human body induction switch, the device enters a standby state;

further, after the step of entering the standby state when the human body induction switch senses the user, the method comprises the following steps:

if the time which is not sensed by the user in the sensing range reaches the first preset time, the standby state is automatically closed.

Further, if the similarity between the voice information and the voice template reaches a first threshold and/or the similarity between the gesture information and the gesture template reaches a second threshold, the step of starting includes:

and if the time which is not sensed by the user in the sensing range reaches the second preset time, closing the equipment.

Further, the number of the voice templates is multiple, each voice template is a voice information base corresponding to one appointed user, and the voice templates are provided with user voice information with preset number; the voice information base comprises: the length of the voice, the volume of the voice, the specific semantics, the speed of the voice and the voice frequency.

Further, after the step of comparing the voice information and/or gesture information with the corresponding voice template or gesture template, the method includes:

if the similarity between the voice information and the information in the preset voice template is lower than a first threshold, the voice information is not responded, and the latest received voice information is reserved.

Further, if the similarity between the voice information and the information in the preset voice template is lower than the first threshold, the step of not responding and retaining the latest received voice information comprises the following steps:

if the signal for manually starting the intelligent equipment is received within the preset time, comparing the reserved latest received voice information with the voice information in the voice template, finding out the voice information with the lowest similarity and replacing the voice information.

if a signal for manually starting the equipment is received within the preset time, prompting whether to update the voice template;

if a command signal for updating the voice template is received, selecting the voice template closest to the received voice information; receiving a piece of new voice information of a user, and prompting whether the new voice information is added to a voice template;

if a confirmation signal for adding the new voice information to the voice template is received, replacing one voice information with the lowest similarity rate in the voice template with the new voice information.

The present application also proposes a device having a memory function, on which a computer program is stored, which, when being executed by a processor, implements the steps of the method of any of the above.

The present application also proposes a terminal comprising a processor and a memory, said memory having stored therein a computer program which, when executed by the processor, implements the steps of the method of any of the preceding claims.

According to the method, the storage device and the terminal for starting and closing the intelligent equipment, the identified voice information and/or gesture information are obtained, the information is compared with the model information, and if the similarity rate reaches a set value, the equipment is started. According to the intelligent device and the intelligent control method, the intelligent device is started quickly through the combination of the voice information and the gestures, and the intelligent device can still be started and closed normally even when the voice function is affected by external noise or inconvenient for a user.

Drawings

FIG. 1 is a flow chart of a method for starting and shutting down a smart device according to an embodiment of the present application;

FIG. 2 is a flowchart of manually updating voice information in a voice template according to an embodiment of the present application;

fig. 3 is a schematic block diagram of a terminal according to an embodiment of the present application.

The achievement of the objects, functional features and advantages of the present invention will be further described with reference to the accompanying drawings, in conjunction with the embodiments.

Detailed Description

It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.

Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative only and are not to be construed as limiting the invention.

As used herein, the singular forms "a", "an", "the" and "the" are intended to include the plural forms as well, unless expressly stated otherwise, as understood by those skilled in the art. It will be further understood that the terms "comprises" and/or "comprising," when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Further, "connected" or "coupled" as used herein may include wirelessly connected or wirelessly coupled. The term "and/or" as used herein includes all or any element and all combination of one or more of the associated listed items.

It will be understood by those skilled in the art that all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs unless defined otherwise. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.

As used herein, a "terminal" includes both a device of a wireless signal receiver having no transmitting capability and a device of receiving and transmitting hardware having receiving and transmitting hardware capable of performing bi-directional communications over a bi-directional communication link, as will be appreciated by those skilled in the art. Such a device may include: a cellular or other communication device having a single-line display or a multi-line display or a cellular or other communication device without a multi-line display; a PCS (Personal Communications Service, personal communication system) that may combine voice, data processing, facsimile and/or data communication capabilities; a PDA (Personal Digital Assistant ) that can include a radio frequency receiver, pager, internet/intranet access, web browser, notepad, calendar and/or GPS (Global Positioning System ) receiver; a conventional laptop and/or palmtop computer or other appliance that has and/or includes a radio frequency receiver. As used herein, "terminal," "terminal device" may be portable, transportable, installed in a vehicle (aeronautical, maritime, and/or land-based), or adapted and/or configured to operate locally and/or in a distributed fashion, to operate at any other location(s) on earth and/or in space. The "terminal" and "terminal device" used herein may also be a communication terminal, a network access terminal, and a music/video playing terminal, for example, may be a PDA, a MID (Mobile Internet Device ), and/or a mobile phone with a music/video playing function, and may also be a smart tv, a set top box, and other devices.

Referring to fig. 1, an embodiment of the present application proposes a method for starting up and shutting down a smart device, the method including the steps of:

s1, acquiring voice information and/or gesture information of a user;

s2, comparing the voice information and/or gesture information with a corresponding voice template or gesture template;

s3, if the similarity between the voice information and the voice template reaches a first threshold value and/or the similarity between the gesture information and the gesture template reaches a second threshold value, starting.

As described in the above step S1, the voice information refers to a specific voice uttered or uttered by the user, and the gesture information refers to a gesture or motion of the user captured by a sensor (including a camera and a distance sensor) on the device, including capturing height information. The height information is used for quickly matching the gesture templates. For example, three people, namely parents and children, exist in a family user, the difference of heights of the people can be large, the detected gesture information can be accurately compared with the corresponding gesture template by directly capturing the height information and rapidly matching the gesture template and then detecting the gesture information, and therefore judgment can be efficiently and accurately made.

The voice template refers to voice information recorded by the user and stored in the device, as described in the step S2. The gesture template refers to a photo or a short video recorded by a user, and is used for comparing whether the action of the user is consistent with the recorded action; the gesture templates may be actions of a part of the user's body or may be actions of the whole body. Wherein the photo templates are used to compare stationary actions and the short video templates are used to compare continuous actions.

As described in the step S3, the first threshold and the second threshold are preset, and the accuracy is different due to different sensors; and the input information of the same user is not 100% identical with the corresponding template, so that the two points are combined, and a proper threshold value is set as a correct reference. For example, in the acquired voice information, the voice information is compared with the voice information in the voice template, and if the similarity rate is more than 70%, the acquired voice information is judged to be consistent with the template. At this time, the device is started.

In an embodiment, before the step of obtaining the voice information and/or gesture information of the user, the method includes:

in this embodiment, the human body sensing switch is a special switch capable of detecting whether a person exists in a detection area. For example, a pyroelectric human body sensing switch emits infrared rays with a wavelength of 9.5 μm due to the fact that the human body temperature is constant at about 37 ℃, and the pyroelectric human body sensing switch is a sensor which is dependent on a specific type to detect the infrared rays emitted by a human body. The infrared rays with the diameter of 9.5 mu m emitted by a human body are enhanced and gathered on an infrared induction source of the human body induction switch through a Fresnel lens, the infrared induction source usually adopts a pyroelectric element, the element loses charge balance when receiving the change of the temperature of the infrared radiation of the human body, charges are released outwards, and a follow-up circuit can trigger the switch to act after detection processing. The human body sensing switch is continuously turned on without leaving the sensing range. In this embodiment, when the user is sensed to be nearby (i.e. in the sensing area of the human body sensing switch), the device is awakened to enter a standby state, and the switch is continuously turned on as long as the user is in the sensing area, so that the device is kept in the standby state. The intelligent human body induction switch is convenient for users to use, fully embodies the intelligence of the artificial intelligent device, is a switch with extremely low energy consumption, is usually not more than 0.1w, and has energy consumption lower by one to two orders of magnitude than that of normal opening when the household equipment is in a standby state.

In an embodiment, after the step of waking up the device to enter the standby state when the human body induction switch senses the user, the method includes:

In this embodiment, the function of the body sensing switch is to sense whether the user is within the detection range of the device, if the body sensing switch senses that the user is not within the detection range, the standby state of the device will not be significant at this time, so when the time when the body sensing switch senses that the user is not within the detection range reaches the first preset time, the device is turned off. The processing mode embodies the intelligence of the equipment and saves energy.

In an embodiment, if the similarity between the voice information and the voice template reaches a first threshold and/or the similarity between the gesture information and the gesture template reaches a second threshold, the step of starting includes:

In this embodiment, the human body induction switch still continuously detects whether the user is in the detection range during the normal operation of the device opening, if the user is in the detection range, the user is indicated to open the device and use the device normally, if the user is not in the detection range, the device is normally opened, and if the user is not in the detection range, the user is indicated to leave temporarily or forget to close the device; therefore, in the normal working process after the equipment is started, if no user is detected in the detection range, recording the time node; after reaching the second preset time, if the user is still not detected in the detection range, the fact that the time of leaving the user reaches the second preset time is indicated, and the equipment is closed at the moment, so that the method is a reasonable energy saving mode. Specifically, for example, when the user returns to home, the smart television is started, a telephone is received during watching television, and when the user forgets to turn off the television during going out, the smart television makes a time record mark when the user goes out, and after the time length reaches the set 15min, the user still does not return home, and at the moment, the smart television is automatically turned off; and in practice the user leaves home 4 hours later. In the embodiment, the intelligent closing process of the intelligent equipment is fully embodied.

In an embodiment, the number of the voice templates is multiple, each voice template is a voice information base corresponding to one appointed user, and a preset number of user voice information is arranged in the voice templates; the voice information base comprises: the length of the voice, the volume of the voice, the specific semantics, the speed of the voice and the voice frequency.

In this embodiment, the comparison weight can be set for the length of the voice, the volume of the voice, the specific semantics, the speed of the voice and the voice frequency, that is, the recognition of the similarity can be correspondingly adjusted. For example, weight adjustments to a particular semantic or speech frequency, i.e., toward semantic activation or toward voice activation by the user. Specifically, a voice template of the user for the third party is stored in an intelligent device, when the length of the voice, the volume of the voice, specific semantics and the weight ratio of the speed of the voice are respectively set to be 30%,10%,10% and 50%, when the voice is used for the third party to deactivate the device, the length of the voice, the volume of the voice, specific semantics and the speed of the voice in voice information sent by the third party are respectively compared with the voice template, the compared results are respectively multiplied by 30%,10%,10% and 50%, and then the comparison results are added, and when the added results are larger than the set threshold, the device is started. The advantage of this arrangement over semantic recognition alone in general is a wider population of users and higher accuracy for specific users. For example, semantics can generally only be set for certain languages that are widely used, such as chinese, english, etc. For users with nonstandard speaking, such as speaking with dialects, the semantic recognition accuracy will be greatly compromised; for stuttering users, the intelligence of the intelligent device is difficult to embody only through semantic recognition. For the special user, the speaking characteristics are mainly reflected in the speed of the voice and the length of the voice, and in the voice recognition, the weight of the two items is improved, so that the intelligent equipment can be started intelligently for the special user.

In an embodiment, after the step of comparing the voice information and/or gesture information with the corresponding voice template or gesture template, the method includes:

In this embodiment, when the received voice information is not matched with the model in the voice template, the latest received voice information is retained, and in order to eliminate noise interference, a lower limit of the similarity ratio may be set. For example, a similarity ratio below 90% will not activate the device, a similarity ratio between 60% and 90% will be preserved by the device, a similarity ratio below 60% will not be preserved, and only the latest piece of input voice information is preserved when it is preserved.

In an embodiment, if the similarity between the voice information and the information in the preset voice template is lower than the first threshold, the step of not responding and retaining the latest received voice information includes:

In this embodiment, it is explained that when the user issues a voice information command and cannot start the device, after the user manually starts the device, the latest received voice information is reserved in the matched user information base to replace a voice template with the lowest similarity to the voice in the voice information base.

Referring to fig. 2, in an embodiment, if the similarity ratio between the voice information and the information in the preset voice template is lower than the first threshold, the step of not responding and retaining the latest received voice information includes:

s10, if a signal for manually starting the equipment is received within a preset time, prompting whether to update the voice template;

s20, if a command signal for updating the voice template is received, selecting the voice template closest to the received voice information; receiving a piece of new voice information of a user, and prompting whether the new voice information is added to a voice template;

and S30, if a confirmation signal for adding the new voice information to the voice template is received, replacing one voice information with the lowest similarity rate in the voice template with the new voice information.

If the voice message is received first, as described in step S10 above, the device is not started because the voice message matching degree does not reach the threshold. And a signal for manually starting the device is received within a preset time, which indicates that the voice information in the voice library is likely to be problematic. The preset time is to ensure that when the voice information command cannot start the device in this embodiment, the user immediately makes an operation of starting the device, and then prompts whether to update the voice template. Thereby ensuring that the prompt at the moment meets the requirements of users.

As described in the above step S20, when a command signal for updating the voice template is received, the process of selecting the voice template closest to the received voice information is: and comparing the received voice information with voice information in all voice templates, and finding out a voice information with the highest similarity after all comparison is completed, wherein the selected voice template is the template in which the voice information with the highest similarity is located. And after the template is selected, receiving new voice information recorded by the user, and prompting the user to add the new voice information into the voice template.

As described in the above step S30, in combination with step S20, the device achieves the purpose of updating the voice template by re-entering the voice information of the user, that is, identifying the new voice information, and replacing the old voice information.

In an embodiment, the manual updating method of gesture information in the present invention is the same as the above-mentioned manual updating method of voice information, and will not be described herein.

The embodiment of the application also provides a device with a storage function, and a computer program is stored on the device, and is characterized in that the computer program is executed by a processor to execute the steps of any one of the methods.

Referring to fig. 3, an embodiment of the present invention further provides a terminal, including a processor 1080 and a memory 1020, wherein the memory 1020 stores a computer program, which when executed by the processor 1080, implements the steps of the method for starting up and shutting down the smart device.

For convenience of explanation, only those portions of the embodiments of the present invention that are relevant to the embodiments of the present invention are shown, and specific technical details are not disclosed, please refer to the method portions of the embodiments of the present invention. The terminal may be any terminal device including a smart PC, a tablet PC, a PDA (Personal Digital Assistant ), a POS (Point of Sales), a vehicle-mounted computer, etc., taking the terminal as an example of the smart PC:

fig. 3 is a block diagram showing a part of the structure of a smart PC related to a terminal provided by an embodiment of the present invention. Referring to fig. 3, the smart PC includes: radio Frequency (RF) circuitry 1010, memory 1020, input unit 1030, display unit 1040, sensor 1050, audio circuitry 1060, wireless fidelity (wireless fidelity, wiFi) module 1070, processor 1080, and power source 1090. Those skilled in the art will appreciate that the smart PC architecture shown in fig. 3 is not limiting of smart PCs and may include more or fewer components than shown, or may combine certain components, or may be arranged in different components.

The following describes the respective constituent elements of the smart PC in detail with reference to fig. 3:

the RF circuit 1010 may be used for receiving and transmitting signals during a message or a call, and particularly, after receiving downlink information of a base station, the signal is processed by the processor 1080; in addition, the data of the design uplink is sent to the base station. Typically, the RF circuitry 1010 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier (Low Noise Amplifier, LNA), a duplexer, and the like. In addition, the RF circuitry 1010 may also communicate with networks and other devices via wireless communications. The wireless communications may use any communication standard or protocol including, but not limited to, global system for mobile communications (Global System of Mobile communication, GSM), general packet radio service (General Packet Radio Service, GPRS), code division multiple access (Code Division Multiple Access, CDMA), wideband code division multiple access (Wideband Code Division Multiple Access, WCDMA), long term evolution (Long Term Evolution, LTE), email, short message service (Short Messaging Service, SMS), and the like.

The memory 1020 may be used to store software programs and modules, and the processor 1080 may perform various functional applications and data processing of the smart PC by executing the software programs and modules stored in the memory 1020. The memory 1020 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program (such as a sound playing function, an image playing function, etc.) required for at least one function, and the like; the storage data area may store data (such as audio data, phonebook, etc.) created according to the use of the smart PC, etc. In addition, memory 1020 may include high-speed random access memory and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state memory device.

The input unit 1030 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the smart PC. In particular, the input unit 1030 may include a touch panel 1031 and other input devices 1032. The touch panel 1031, also referred to as a touch screen, may collect touch operations thereon or thereabout by a user (e.g., operations of the user on the touch panel 1031 or thereabout using any suitable object or accessory such as a finger, stylus, etc.), and drive the corresponding connection device according to a predetermined program. Alternatively, the touch panel 1031 may include two parts, a touch detection device and a touch controller. The touch detection device detects the touch azimuth of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch detection device and converts it into touch point coordinates, which are then sent to the processor 1080 and can receive commands from the processor 1080 and execute them. Further, the touch panel 1031 may be implemented in various types such as resistive, capacitive, infrared, and surface acoustic wave. The input unit 1030 may include other input devices 1032 in addition to the touch panel 1031. In particular, other input devices 1032 may include, but are not limited to, one or more of a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a track ball, a mouse, a joystick, etc.

The display unit 1040 may be used to display information input by a user or information provided to the user and various menus of the smart PC. The display unit 1040 may include a display panel 1041, and alternatively, the display panel 1041 may be configured in the form of a liquid crystal display (Liquid Crystal Display, LCD), an Organic Light-Emitting Diode (OLED), or the like. Further, the touch panel 1031 may overlay the display panel 1041, and when the touch panel 1031 detects a touch operation thereon or thereabout, the touch panel is transferred to the processor 1080 to determine a type of touch event, and then the processor 1080 provides a corresponding visual output on the display panel 1041 according to the type of touch event. Although in fig. 3, the touch panel 1031 and the display panel 1041 are two separate components to implement the input and input functions of the smart PC, in some embodiments, the touch panel 1031 may be integrated with the display panel 1041 to implement the input and output functions of the smart PC.

The smart PC may also include at least one sensor 1050, such as a light sensor, a motion sensor, and other sensors. Specifically, the light sensor may include an ambient light sensor that may adjust the brightness of the display panel 1041 according to the brightness of ambient light, and a proximity sensor that may turn off the display panel 1041 and/or the backlight when the smart PC moves to the ear. As one of the motion sensors, the accelerometer sensor can detect the acceleration in all directions (generally three axes), and can detect the gravity and direction when stationary, and can be used for recognizing intelligent PC gestures (such as horizontal and vertical screen switching, related games, magnetometer gesture calibration), vibration recognition related functions (such as pedometer, knocking) and the like; other sensors such as gyroscopes, barometers, hygrometers, thermometers, infrared sensors, etc. that may also be configured by the smart PC are not described in detail herein.

Audio circuitry 1060, a speaker 1061, and a microphone 1062 may provide an audio interface between the user and the smart PC. Audio circuit 1060 may transmit the received electrical signal after audio data conversion to speaker 1061 for conversion by speaker 1061 into an audio signal output; on the other hand, microphone 1062 converts the collected sound signals into electrical signals, which are received by audio circuit 1060 and converted into audio data, which are processed by audio data output processor 1080 for transmission to, for example, another smart PC via RF circuit 1010 or for output to memory 1020 for further processing.

WiFi belongs to a short-distance wireless transmission technology, and the intelligent PC can help a user to send and receive e-mails, browse web pages, access streaming media and the like through the WiFi module 1070, so that wireless broadband Internet access is provided for the user. Although fig. 3 shows a WiFi module 1070, it is understood that it does not belong to the essential constitution of a smart PC, and can be omitted entirely as required within the scope of not changing the essence of the invention.

Processor 1080 is the control center of the smart PC, connects the various parts of the overall smart PC using various interfaces and lines, and performs various functions and processes data of the smart PC by running or executing software programs and/or modules stored in memory 1020 and invoking data stored in memory 1020, thereby performing overall monitoring of the smart PC. Optionally, processor 1080 may include one or more processing units; preferably, processor 1080 may integrate an application processor primarily handling operating systems, user interfaces, applications, etc., with a modem processor primarily handling wireless communications. It will be appreciated that the modem processor described above may not be integrated into processor 1080.

The smart PC also includes a power source 1090 (e.g., a battery) for powering the various components, which may preferably be logically coupled to the processor 1080 via a power management system, such as to provide charge, discharge, and power management functions via the power management system.

Although not shown, the smart PC may further include a camera, a bluetooth module, etc., which will not be described herein.

Referring to fig. 3, in an embodiment of the present invention, the processor 1080 included in the terminal further has the following functions:

acquiring voice information and/or gesture information of a user;

It will be clear to those skilled in the art that, for convenience and brevity of description, specific working procedures of the above-described systems, apparatuses and units may refer to corresponding procedures in the foregoing method embodiments, which are not repeated herein.

In the several embodiments provided in this application, it should be understood that the disclosed systems, apparatuses, and methods may be implemented in other ways. For example, the apparatus embodiments described above are merely illustrative, e.g., the division of the units is merely a logical function division, and there may be additional divisions when actually implemented, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, which may be in electrical, mechanical or other form.

In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.

Those of ordinary skill in the art will appreciate that all or a portion of the steps in implementing the methods of the above embodiments may be implemented by a program to instruct related hardware, where the program may be stored in a computer readable storage medium, where the storage medium may be a read only memory, a magnetic disk or optical disk, etc.

The foregoing description is only of the preferred embodiments of the present invention and is not intended to limit the scope of the invention, and all equivalent structures or equivalent processes using the descriptions and drawings of the present invention or directly or indirectly applied to other related technical fields are included in the scope of the invention.

Claims

1. A method of powering on and off a smart device, the method comprising the steps of:

acquiring voice information and/or gesture information of a user;

if the similarity between the voice information and the information in the preset voice template is lower than a first threshold, not responding and reserving the latest received voice information;

if a confirmation signal for adding the new voice information to the voice template is received, replacing one voice information with the lowest similarity rate in the voice template with the new voice information;

2. The method for starting and stopping a smart device according to claim 1, wherein before the step of obtaining voice information and/or gesture information of the user, the method comprises:

when a user is sensed through the human body induction switch, the standby state is entered.

3. The method for starting and stopping the intelligent device according to claim 2, wherein after the step of entering the standby state when the human body sensing switch senses the user, the method comprises:

4. The method for starting and closing a smart device according to claim 1, wherein if the similarity between the voice information and the voice template reaches a first threshold and/or the similarity between the gesture information and the gesture template reaches a second threshold, the step of starting comprises:

5. The method for starting and closing an intelligent device according to claim 1, wherein the number of the voice templates is plural, each voice template is a voice information base corresponding to a specific user, and a preset number of user voice information are set in the voice templates; the voice information base comprises: the length of the voice, the volume of the voice, the specific semantics, the speed of the voice and the voice frequency.

6. The method for starting up and shutting down a smart device according to claim 1, wherein if the similarity between the voice information and the information in the preset voice template is lower than a first threshold, the method does not respond, and after the step of retaining the latest received voice information, the method comprises:

7. An apparatus having a storage function, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method according to any one of claims 1 to 6.

8. A terminal comprising a processor and a memory, characterized in that the memory has stored therein a computer program which, when executed by the processor, implements the steps of the method of any of claims 1 to 6.