CN112992130A

CN112992130A - Voice monitoring method and device and mobile intelligent device

Info

Publication number: CN112992130A
Application number: CN201911282304.0A
Authority: CN
Inventors: 乔薇
Original assignee: Beijing Anyun Century Technology Co Ltd
Current assignee: Beijing Anyun Century Technology Co Ltd
Priority date: 2019-12-13
Filing date: 2019-12-13
Publication date: 2021-06-18

Abstract

The invention discloses a voice monitoring method, a voice monitoring device and mobile intelligent equipment, wherein the voice monitoring method comprises the following steps: receiving a voice signal sent by a user; detecting whether the voice signal contains preset keywords or not, wherein the preset keywords are used for representing negative emotions; and if the voice signal contains the preset keyword, controlling the display device to display preset reminding information. The voice monitoring method, the voice monitoring device and the mobile intelligent equipment can enable the user to timely perceive that the user or people around the user generate negative emotion, are beneficial to improving the mental health of the user, effectively enhance the effect of the user in the aspect of improving the use of negative languages, and realize the effect of helping to solve the mental health crisis in an artificial intelligence mode.

Description

Voice monitoring method and device and mobile intelligent device

Technical Field

The invention relates to the technical field of intelligent equipment, in particular to a voice monitoring method and device and mobile intelligent equipment.

Background

With the rapid development of economic society, the pace of life is accelerated, competition is increased, various external pressures are increased, and people inevitably generate negative emotions such as distraction, fasciation, anger, pessimism, disappointment and the like. If people are held for a long time for the negative emotion to grow, the people will be out of control of emotion, and the psychological health of people is affected. Mobile intelligent devices such as mobile phones become devices which are accompanied by users all the time, and in the prior art, the mobile intelligent devices cannot assist users in adjusting emotions, so that intelligent experience is poor.

Disclosure of Invention

The invention aims to provide a voice monitoring method and device and mobile intelligent equipment, and the voice monitoring method and device can remind a user in an intelligent manner when the user generates negative emotion.

The invention is realized by the following technical scheme:

a voice monitoring method is applied to mobile intelligent equipment and comprises the following steps:

receiving a voice signal sent by a user;

detecting whether the voice signal contains preset keywords or not, wherein the preset keywords are used for representing negative emotions;

and if the voice signal contains the preset keyword, controlling the display device to display preset reminding information.

Optionally, the receiving a voice signal sent by a user includes:

the voice signal is received within a preset time period.

Optionally, before the receiving the voice signal sent by the user, the method further includes:

receiving time period configuration information, wherein the time period configuration information comprises the preset time period.

Optionally, before the detecting whether the voice signal includes a preset keyword, the method further includes:

and denoising the voice signal.

judging whether the voice signal is sent out by a specific user or not;

and if the voice signal is sent by the specific user, executing the step of detecting whether the voice signal contains a preset keyword.

and receiving keyword configuration information, wherein the keyword configuration information comprises the preset keywords.

Optionally, after detecting whether the voice signal contains a preset keyword, the method further includes:

and if the voice signal contains the preset keyword, controlling the mobile intelligent equipment to generate a reminding action.

and receiving action configuration information, wherein the action configuration information comprises the reminding action.

Optionally, the reminding action includes at least one of generating a voice prompt, generating a light flash or generating a vibration.

and if the voice signal contains the preset keyword, receiving a voice instruction sent by the user, and executing the voice instruction.

Optionally, before the controlling the display device to display the preset reminding information, the method further includes:

if the voice signal contains the preset keyword, judging whether the preset keyword is detected for the Nth time within preset time length or not, wherein N is an integer not less than 2;

and if the preset keyword is detected for the Nth time within the preset time length, the step of controlling the display device to display preset reminding information is executed.

Based on the same inventive concept, the invention also provides a voice monitoring device, which is applied to mobile intelligent equipment and comprises:

the first receiving module is used for receiving a voice signal sent by a user;

the detection module is used for detecting whether the voice signal contains a preset keyword or not, wherein the preset keyword is used for representing negative emotion;

and the control module is used for controlling the display device to display preset reminding information when the voice signal contains the preset keywords.

Optionally, the first receiving module is configured to receive the voice signal within a preset time period.

Optionally, the voice monitoring apparatus further includes:

and the second receiving module is used for receiving time period configuration information, wherein the time period configuration information comprises the preset time period.

Optionally, the voice monitoring apparatus further includes:

and the denoising processing module is used for denoising the voice signal.

Optionally, the voice monitoring apparatus further includes a second determining module;

the second judging module is used for judging whether the voice signal is sent out by a specific user;

the detection module is used for detecting whether the voice signal contains preset keywords when the voice signal is sent by the specific user.

Optionally, the voice monitoring apparatus further includes:

and the third receiving module is used for receiving keyword configuration information, wherein the keyword configuration information comprises the preset keywords.

Optionally, the control module is further configured to control the mobile smart device to generate a reminding action.

Optionally, the voice monitoring apparatus further includes:

and the fourth receiving module is used for receiving action configuration information, wherein the action configuration information comprises the reminding action.

Optionally, the voice monitoring apparatus further includes:

a fifth receiving module, configured to receive a voice instruction sent by the user;

and the instruction execution module is used for executing the voice instruction.

Optionally, the voice monitoring apparatus further includes a third determining module;

the third judging module is used for judging whether the preset keyword is detected for the Nth time within the preset time length or not, wherein N is an integer not less than 2;

the control module is used for controlling the display device to display preset reminding information when the preset keyword is detected for the Nth time within the preset time.

Based on the same inventive concept, the invention further provides a mobile intelligent device, which comprises a display device, a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor implements the steps of the voice monitoring method when executing the computer program.

Based on the same inventive concept, the present invention also provides a computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, implements the steps of the above-mentioned voice listening method.

Compared with the prior art, the invention has the following advantages and beneficial effects:

according to the voice monitoring method and device and the mobile intelligent device, the voice signal sent by the user is detected, and the display device of the mobile intelligent device is controlled to display the preset reminding information when the voice signal is detected to contain the preset keyword representing the negative emotion, so that the user can timely perceive that the user or people nearby generate the negative emotion, the psychological health of the user is improved, and the effect of the user in the aspect of improving the use of the negative language is effectively enhanced. The voice monitoring method and device and the intelligent equipment provided by the invention realize the effect of helping to solve the psychological health crisis by adopting an artificial intelligence mode.

Drawings

The accompanying drawings, which are included to provide a further understanding of the embodiments of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principles of the invention. In the drawings:

FIG. 1 is a flowchart of a voice monitoring method according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of a voice monitoring apparatus according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of a mobile intelligent device according to an embodiment of the present invention.

Detailed Description

The invention provides a voice monitoring method and device and a mobile intelligent device, wherein a voice signal sent by a user is detected, and a display device of the mobile intelligent device is controlled to display preset reminding information when the voice signal is detected to contain preset keywords representing negative emotion, so that the user can timely perceive that the user or people around the user generate negative emotion, and the method and device are beneficial to improving the psychological health of the user.

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to examples and accompanying drawings, and the exemplary embodiments and descriptions thereof are only used for explaining the present invention and are not meant to limit the present invention.

Example 1

The embodiment provides a voice monitoring method, which is applied to a mobile intelligent device provided with a display device and used for monitoring keywords for indicating that a user generates negative emotion. The mobile intelligent device can be an intelligent device which can be carried about, such as an intelligent mobile phone, an intelligent bracelet, an intelligent ring or an intelligent watch, and the embodiment of the invention does not limit the intelligent device. Fig. 1 is a flowchart of the voice monitoring method, where the voice monitoring method includes:

step S11, receiving a voice signal sent by a user;

step S12, detecting whether the voice signal contains a preset keyword, wherein the preset keyword is used for representing negative emotion;

if the voice signal includes the preset keyword, step S13 is executed to control the display device to display preset reminding information.

Specifically, a voice acquisition device, such as a microphone or the like, may be used to receive the voice signal. In an optional implementation manner, the voice collecting apparatus may be always in a working state, and monitor the voice signal uninterruptedly. In another alternative implementation, considering that the probability of the user making a sound in some special time periods is very small, for example, the user is in a sleep state in a late-night time period, the monitoring of the voice signal may be stopped, and thus the receiving of the voice signal made by the user may include: the voice signal is received within a preset time period. The preset time period can be set according to actual requirements, and this embodiment does not limit this. For example, if the monitoring of the voice signal is stopped in the sleep time period of the user, the preset time period may be set as a non-sleep time period, for example, 7:00 a day in the morning to 10:00 a day in the evening. The preset time period can be preset by a developer, and can also be set by a user according to the requirement of the user. When the preset time period is set by the user, time period configuration information including the preset time period may be input by the user through an input device such as a key, a touch screen, or a voice capture device before step S11 is performed. And receiving the voice signal in the preset time period by receiving the time period configuration information. By receiving the voice signal in the preset time period, the voice acquisition device can be in a closed state in the time period when the probability that the user makes a sound is very small, so that the power consumption of the intelligent device can be reduced.

After receiving the voice signal, it may be directly detected whether the voice signal includes the preset keyword, that is, the voice signals of all users in the current scene are monitored. In another optional implementation manner, before performing the detecting whether the voice signal includes a preset keyword, the method further includes: judging whether the voice signal is sent out by a specific user or not; and if the voice signal is sent by the specific user, executing the step of detecting whether the voice signal contains a preset keyword. In particular, voiceprint features can be extracted from the speech signal; matching the voiceprint features extracted from the voice signals with the pre-recorded voiceprint features of the specific user; and if the voiceprint features extracted from the voice signal are matched with the voiceprint features of the specific user, determining that the voice signal is sent by the specific user. By judging whether the voice signal is sent by the specific user before detecting whether the voice signal contains the preset keyword, the voice monitoring can be carried out on the specific user.

After receiving the voice signal, it may be directly detected whether the voice signal includes the preset keyword or whether the voice signal is sent by a specific user. In an optional implementation manner, in consideration of possible environmental noise in an environment where the smart device is located, before performing the detecting whether the voice signal includes a preset keyword or performing the determining whether the voice signal is emitted by a specific user, the method further includes: and denoising the voice signal. By denoising the voice signals, the environmental noise mixed in the voice signals can be shielded, and the accuracy of voice monitoring is improved.

The preset keywords are keywords for representing negative emotions, such as complaints, desperation, anger and other emotions, and the preset keywords are "worried", "vexed", "disappointed" and the like. The preset keywords can be configured in advance by developers, and can also be set by users according to self requirements. When the preset keyword is set by the user, before performing step S11, the user may input keyword configuration information, which includes the preset keyword, through an input device such as a key, a touch screen, or a voice capture device. By receiving the keyword configuration information, the requirement of user personalized configuration can be met.

And detecting whether the voice signal contains the preset keyword or not, wherein the preset keyword can be identified by adopting a voice awakening technology. For example, the detection may be performed by using a KWS based on template matching, and the steps of KWS training and testing based on template matching are relatively simple, where the training is to perform feature extraction and construct a template according to a registered voice or a template voice, generate a feature sequence through feature extraction, calculate a distance between the tested feature sequence and the template sequence, and determine whether to wake up based on the distance. The KWS based on the HMM-GMM can be used for detection, and the KWS based on the HMM-GMM converts the awakening task into two types of recognition tasks, wherein the recognition tasks are a keyword and a non-keyword. Neural network based schemes can be used for detection, and can be subdivided into several categories: the first type is HMM-based KWS, and acoustic model modeling is converted from GMM to neural network model; the second kind of template matching merged into the neural network, and the neural network is adopted as a feature extractor; the third category is based on end-to-end solutions, where speech is input and output is the probability of each wake-up, a model solution. The skilled person knows how to recognize the preset keyword using the above-mentioned several voice wake-up techniques, and will not be described in detail here.

When the voice signal is detected to contain the preset keyword, the display device can be directly controlled to display the preset reminding information. In an alternative implementation manner, before performing control of the display device to display the preset reminding information, considering that the user accidentally speaks the preset keyword instead of generating a negative condition, the method further includes: if the voice signal contains the preset keyword, judging whether the preset keyword is detected for the Nth time within preset time length or not, wherein N is an integer not less than 2; and if the preset keyword is detected for the Nth time within the preset time length, controlling the display device to display preset reminding information. The preset time period may be set according to actual requirements, for example, may be set to 5 minutes, 10 minutes, and the like, which is not limited in this embodiment. The monitoring accuracy can be improved by judging whether the preset keyword is detected for the Nth time within the preset time.

The preset reminding information may be text information, image information, or a combination of the text information and the image information, which is not limited in this embodiment. In addition to controlling the display device to display the preset reminding information, in an optional implementation manner, after detecting whether the voice signal includes a preset keyword, the method may further include: and if the voice signal contains the preset keyword, controlling the mobile intelligent equipment to generate a reminding action. The reminding action can be set according to actual requirements and can comprise at least one of voice prompt generation, light flicker generation or vibration generation. Taking the reminding action as an example of generating a voice prompt, the voice prompt may be "change positive. The reminding action can be configured in advance by a developer, and can also be set by a user according to the requirement of the user. When the alert action is set by the user, the action configuration information including the alert action may be input by the user through an input device such as a key, a touch screen, or a voice recognition device before step S11 is performed. By receiving the action configuration information, the requirement of user personalized configuration can be met.

In an optional implementation manner, after the detecting whether the voice signal includes a preset keyword, the method may further include: and if the voice signal contains the preset keyword, receiving a voice instruction sent by the user, and executing the voice instruction. After the user notices the preset reminding information, the user realizes that negative emotion is generated, and can control the mobile intelligent device to generate actions capable of helping the user adjust the emotion through the voice instruction. For example, after receiving the voice instruction, the mobile smart device may play a song that is favorite by the user, play a joke that is favorite by the user, or display a favorite image of the user, so as to help the user adjust the mood of the user.

According to the voice monitoring method provided by the embodiment, the voice signal sent by the user is detected, the control instruction is generated when the voice signal is detected to contain the preset keyword for representing the negative emotion, and the preset reminding information is displayed by controlling the display device, so that the user can timely perceive the negative emotion generated by the user or people around the user, the psychological health of the user can be improved, and the effect of the user in the aspect of improving the use of the negative language is effectively enhanced. The voice monitoring method and device and the mobile intelligent device provided by the invention realize the effect of helping to solve the psychological health crisis by adopting an artificial intelligence mode.

Example 2

The embodiment provides a voice monitoring device, which is applied to a mobile intelligent device provided with a display device and used for monitoring keywords for indicating that a user generates negative emotion. The mobile intelligent device can be a portable device such as a smart phone, a smart bracelet, a smart ring or a smart watch, and the embodiment of the invention does not limit the portable device. Fig. 2 is a schematic structural diagram of the voice monitoring apparatus, which includes:

a first receiving module 21, configured to receive a voice signal sent by a user;

a detection module 22, configured to detect whether the voice signal includes a preset keyword, where the preset keyword is used to represent a negative emotion;

and the control module 23 is configured to control the display device to display preset reminding information when the voice signal contains the preset keyword.

In an optional implementation manner, the first receiving module 21 is configured to receive the speech signal within a preset time period.

In an optional implementation manner, the apparatus for voice monitoring further includes:

and the denoising processing module is used for denoising the voice signal.

In an optional implementation manner, the voice monitoring apparatus further includes a second determining module;

the detection module 22 is configured to detect whether the voice signal contains a preset keyword when the voice signal is sent by the specific user.

In an optional implementation manner, the control module 13 is further configured to control the mobile smart device to generate a reminding action.

In an alternative implementation, the alert action includes at least one of generating a voice prompt, generating a light flash, or generating a vibration.

In an optional implementation manner, the voice monitoring apparatus further includes a third determining module;

the control module 23 is configured to control the display device to display a preset reminding message when the preset keyword is detected for the nth time within the preset duration.

The implementation principle of the voice monitoring apparatus can refer to the description of embodiment 1, and this embodiment is not described again.

Example 3

This embodiment provides a mobile intelligent device, where the intelligent device includes a display device, a memory, a processor, and a computer program stored in the memory and capable of running on the processor, and the processor implements the voice listening method described in embodiment 1 when executing the computer program. For convenience of explanation, only the parts related to the embodiment are shown, and details of the specific technology are not disclosed. The intelligent device can be a portable device such as a smart phone, a smart bracelet, a smart ring or a smart watch, and the embodiment of the invention does not limit the portable device. Taking the mobile intelligent device as an intelligent mobile phone as an example:

fig. 3 is a schematic diagram of a part of a structure related to the smart device provided in this embodiment. Referring to fig. 3, the smart phone includes: radio frequency circuit 310, memory 320, input unit 330, display unit 340, sensor 350, audio circuit 360, wireless fidelity (Wi-Fi) module 370, processor 380, and power supply 390. Those skilled in the art will appreciate that the smartphone configuration shown in fig. 3 is not limiting and may include more or fewer components than shown, or some components in combination, or a different arrangement of components.

The following specifically describes each component of the smart phone with reference to fig. 3:

the RF circuit 310 may be used for receiving and transmitting signals during information transmission and reception or during a call, and in particular, receives downlink information of a base station and then processes the received downlink information to the processor 380; in addition, the data for designing uplink is transmitted to the base station. In general, RF circuit 310 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. In addition, RF circuit 310 may also communicate with networks and other devices via wireless communication. The wireless communication may use any communication standard or protocol including, but not limited to, global system for mobile communications, general packet radio service, code division multiple access, wideband code division multiple access, long term evolution, email, short message service, etc.

The memory 320 may be used to store software programs and modules, and the processor 380 executes various functional applications and data processing of the smart phone by operating the software programs and modules stored in the memory 320. The memory 320 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the smartphone, and the like. Further, the memory 320 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.

The input unit 330 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the smartphone. Specifically, the input unit 330 may include a touch panel 331 and other input devices 332. The touch panel 331, also referred to as a touch screen, can collect touch operations of a user (e.g., operations of the user on the touch panel 331 or near the touch panel 331 using any suitable object or accessory such as a finger, a stylus, etc.) on or near the touch panel 331, and drive the corresponding connection device according to a preset program. Alternatively, the touch panel 331 may include two parts, a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 380, and can receive and execute commands sent by the processor 380. In addition, the touch panel 331 may be implemented in various types, such as a resistive type, a capacitive type, an infrared ray, and a surface acoustic wave. The input unit 330 may include other input devices 332 in addition to the touch panel 331. In particular, other input devices 332 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like.

The display unit 340 may be used to display information input by the user or information provided to the user and various menus of the smartphone. The display unit 340 may include a display panel 331, and optionally, the display panel 331 may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. Further, the touch panel 331 can cover the display panel 331, and when the touch panel 331 detects a touch operation on or near the touch panel 331, the touch panel is transmitted to the processor 380 to determine the type of the touch event, and then the processor 380 provides a corresponding visual output on the display panel 331 according to the type of the touch event. Although in fig. 3, the touch panel 331 and the display panel 331 are two separate components to implement the input and output functions of the smart phone, in some embodiments, the touch panel 331 and the display panel 331 may be integrated to implement the input and output functions of the smart phone.

The smartphone may also include at least one sensor 350, such as a light sensor, motion sensor, and other sensors. Specifically, the light sensor may include an ambient light sensor that may adjust the brightness of the display panel 331 according to the brightness of ambient light, and a proximity sensor that may turn off the display panel 331 and/or the backlight when the smartphone is moved to the ear. As one of the motion sensors, the accelerometer sensor can detect the magnitude of acceleration in each direction (generally, three axes), can detect the magnitude and direction of gravity when stationary, and can be used for applications (such as horizontal and vertical screen switching, related games, magnetometer attitude calibration) for recognizing the attitude of the smartphone, and related functions (such as pedometer and tapping) for vibration recognition; as for other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which can be configured on the smart phone, further description is omitted here.

Audio circuitry 360, speaker 361, microphone 362 may provide an audio interface between the user and the smartphone. The audio circuit 360 may transmit the electrical signal converted from the received audio data to the speaker 361, and the audio signal is converted by the speaker 361 and output; on the other hand, the microphone 362 converts the collected sound signals into electrical signals, which are received by the audio circuit 360 and converted into audio data, which are then processed by the audio data output processor 380 and then transmitted to, for example, another smart phone via the RF circuit 310, or output to the memory 320 for further processing.

WiFi belongs to short-distance wireless transmission technology, and the smart phone can help a user to receive and send e-mails, browse webpages, access streaming media and the like through the WiFi module 370, and provides wireless broadband internet access for the user. Although fig. 3 shows the WiFi module 370, it is understood that it does not belong to the essential constitution of the smartphone and may be omitted entirely as needed within the scope not changing the essence of the invention.

The processor 380 is a control center of the smart phone, connects various parts of the entire smart phone by using various interfaces and lines, and performs various functions of the smart phone and processes data by operating or executing software programs and/or modules stored in the memory 320 and calling data stored in the memory 320, thereby integrally monitoring the smart phone. Optionally, processor 380 may include one or more processing units; preferably, the processor 380 may integrate an application processor, which primarily handles operating systems, user interfaces, applications, etc., and a modem processor, which primarily handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into processor 380.

The smart phone also includes a power supply 390 (e.g., a battery) for powering the various components, which may preferably be logically connected to the processor 380 via a power management system, so as to manage charging, discharging, and power consumption via the power management system.

Although not shown, the smart phone may further include a camera, a bluetooth module, and the like, which are not described herein.

Example 4

This embodiment provides a computer-readable storage medium on which a computer program is stored, and the voice monitoring method provided in embodiment 1 of the present invention may be stored in a computer-readable storage medium if it is implemented in the form of a software functional unit and sold or used as an independent product. Based on such understanding, all or part of the flow in the voice monitoring method provided in embodiment 1 may also be implemented by instructing relevant hardware through a computer program, where the computer program may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the above-mentioned method embodiments may be implemented. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying said computer program code, medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution medium, etc. It should be noted that the computer readable medium may contain content that is subject to appropriate increase or decrease as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media does not include electrical carrier signals and telecommunications signals as is required by legislation and patent practice.

The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are merely exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

The invention discloses a1 and a voice monitoring method, which is applied to mobile intelligent equipment provided with a display device and comprises the following steps:

receiving a voice signal sent by a user;

A2, according to the voice monitoring method in A1, the receiving the voice signal sent by the user includes:

the voice signal is received within a preset time period.

A3, according to the voice listening method of a2, before the receiving the voice signal sent by the user, the method further comprises:

A4, according to the voice monitoring method in a1, before the detecting whether the voice signal includes a preset keyword, the method further includes:

and denoising the voice signal.

A5, according to the voice monitoring method in a1, before the detecting whether the voice signal includes a preset keyword, the method further includes:

judging whether the voice signal is sent out by a specific user or not;

A6, according to the voice listening method of a1, before the receiving the voice signal sent by the user, the method further comprises:

A7, according to the voice monitoring method in a1, after the detecting whether the voice signal includes a preset keyword, the method further includes:

A8, according to the voice listening method of a7, before the receiving the voice signal sent by the user, the method further comprises:

A9, the voice monitoring method according to A7, wherein the reminding action comprises at least one of voice prompt generation, light flashing generation or vibration generation.

A10, according to the voice monitoring method in a1, after the detecting whether the voice signal includes a preset keyword, the method further includes:

A11, the method for monitoring voice according to any one of A1 to A10, further comprising before the controlling the display device to display the preset reminding information:

The invention also discloses B12, a voice monitoring device, which is applied to mobile intelligent equipment provided with a display device, and comprises:

the first receiving module is used for receiving a voice signal sent by a user;

B13, the voice monitoring device according to B12, wherein the first receiving module is configured to receive the voice signal within a preset time period.

B14, the device for monitoring voice according to B13, further comprising:

B15, the device for monitoring voice according to B12, further comprising:

and the denoising processing module is used for denoising the voice signal.

B16, the voice monitoring device according to B12, further comprising a second judgment module;

B17, the device for monitoring voice according to B12, further comprising:

B18, the voice monitoring device according to B12, the control module is further used for controlling the mobile intelligent device to generate a reminding action.

B19, the device for monitoring voice according to B18, further comprising:

B20, the audio monitor device according to B18, wherein the reminding action comprises at least one of generating an audio prompt, generating a light flash or generating a vibration.

B21, the device for monitoring voice according to B12, further comprising:

B22, the voice monitoring device according to any one of B12 to B21, further comprising a third judging module;

The invention also discloses C23 and a mobile intelligent device, which comprises a display device, a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor realizes the steps of the voice monitoring method from A1 to A11 when executing the computer program.

The invention also discloses D24, a computer readable storage medium, on which a computer program is stored, which, when being executed by a processor, implements the steps of the voice listening method of any one of A1 to A11.

Claims

1. A voice monitoring method is applied to mobile intelligent equipment provided with a display device, and is characterized by comprising the following steps:

receiving a voice signal sent by a user;

2. The method of claim 1, wherein the receiving the voice signal from the user comprises:

the voice signal is received within a preset time period.

3. The method of claim 2, further comprising, before the receiving the voice signal from the user:

4. The method according to claim 1, further comprising, before the detecting whether the voice signal contains a predetermined keyword, the steps of:

and denoising the voice signal.

5. The method according to claim 1, further comprising, before the detecting whether the voice signal contains a predetermined keyword, the steps of:

judging whether the voice signal is sent out by a specific user or not;

6. The method of claim 1, further comprising, before the receiving the voice signal from the user:

7. The method according to claim 1, further comprising, after the detecting whether the voice signal contains a predetermined keyword, the steps of:

8. The utility model provides a voice monitoring device, is applied to and is provided with display device's mobile intelligent equipment which characterized in that includes:

the first receiving module is used for receiving a voice signal sent by a user;

9. A mobile smart device comprising a display means, a memory, a processor and a computer program stored on said memory and executable on said processor, characterized in that said processor implements the steps of the voice listening method according to any one of claims 1 to 7 when executing said computer program.

10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method for speech listening according to any one of claims 1 to 7.