WO2019041314A1 - Security protection method and apparatus, and smart speaker - Google Patents

Security protection method and apparatus, and smart speaker Download PDF

Info

Publication number
WO2019041314A1
WO2019041314A1 PCT/CN2017/100232 CN2017100232W WO2019041314A1 WO 2019041314 A1 WO2019041314 A1 WO 2019041314A1 CN 2017100232 W CN2017100232 W CN 2017100232W WO 2019041314 A1 WO2019041314 A1 WO 2019041314A1
Authority
WO
WIPO (PCT)
Prior art keywords
ambient sound
abnormal
sound
security protection
voice information
Prior art date
Application number
PCT/CN2017/100232
Other languages
French (fr)
Chinese (zh)
Inventor
蒋壮
张国滔
郑勇
张立新
金志军
向勇阳
卫特超
Original Assignee
深圳市沃特沃德股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市沃特沃德股份有限公司 filed Critical 深圳市沃特沃德股份有限公司
Priority to PCT/CN2017/100232 priority Critical patent/WO2019041314A1/en
Publication of WO2019041314A1 publication Critical patent/WO2019041314A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones

Definitions

  • the present invention relates to the field of security technologies, and in particular, to a security protection method, device, and smart speaker.
  • the current security protection program mainly installs a camera at the target location to perform video surveillance on the target site to achieve security protection for the target site.
  • a primary object of the present invention is to provide a security protection method, apparatus and smart speaker for improving the stability and effectiveness of security protection.
  • an embodiment of the present invention provides a security protection method, where the method includes the following steps.
  • the determining whether the ambient sound is abnormal includes:
  • the determining whether the ambient sound is abnormal includes: [0013] determining whether the volume of the ambient sound is greater than or equal to a threshold;
  • the method further includes:
  • the voice information is sent out.
  • the detecting whether the voice information is included in the ambient sound comprises:
  • the sending the voice information outward comprises: sending the voice information to a user terminal by using an audio-video peer-to-peer network transmission technology.
  • the threshold has at least two, and different segments correspond to different thresholds.
  • the preset value has at least two, and different segments correspond to different preset values.
  • the determining whether the ambient sound is abnormal includes:
  • the ambient sound includes voice information ⁇ , it is determined that the ambient sound is abnormal.
  • the method further includes:
  • the step of collecting an ambient sound further includes:
  • the listening mode is started, and the next step is taken: the ambient sound is collected.
  • Embodiments of the present invention also provide a security protection device, and the device includes:
  • a sound collection module configured to collect an ambient sound
  • the abnormality determining module is configured to determine whether the ambient sound is abnormal
  • the abnormality alarm module is configured to send an alarm message outward when the ambient sound is abnormal.
  • the abnormality determining module includes:
  • the first determining unit is configured to determine whether the volume of the ambient sound is greater than or equal to a threshold
  • a first determining unit configured to: when the volume of the ambient sound is greater than or equal to the threshold ⁇ , The ambient sound is abnormal.
  • the abnormality determining module includes:
  • the first determining unit is configured to determine whether the volume of the ambient sound is greater than or equal to a threshold
  • the second determining unit is configured to: when the volume of the ambient sound is greater than or equal to the threshold ⁇ , determine whether a volume of consecutive N sampling points of the ambient sound is greater than or equal to a preset value, where N Greater than or equal to 2;
  • the second determining unit is configured to determine that the ambient sound is abnormal when the volume of the sampling points having consecutive N of the ambient sounds is greater than or equal to a preset value ⁇ .
  • the device further includes:
  • a voice detection module configured to detect whether the ambient sound includes a linguistic first when the ambient sound is abnormally ⁇ ;
  • the voice sending module is configured to: when the ambient sound includes voice information, send the voice to the outside.
  • the voice detection module is configured to: perform a domain and frequency domain feature analysis on the ambient sound by using a voice activity detection algorithm, and determine whether the voice information is included in the environment sound.
  • the voice sending module is configured to: send the voice information to the user terminal by using an audio-video peer-to-peer network transmission technology.
  • the abnormality determining module includes:
  • a third determining unit configured to determine whether voice information is included in the ambient sound
  • the third determining unit is configured to determine that the environmental sound meter is suspended when the ambient sound includes voice information.
  • the device further includes a voice sending module, configured to: when the ambient sound is abnormal, send the voice information outward.
  • a voice sending module configured to: when the ambient sound is abnormal, send the voice information outward.
  • the device further includes a monitoring and starting module, configured to: when receiving the monitoring command, start the listening mode to trigger the sound collecting module to collect the ambient sound.
  • a monitoring and starting module configured to: when receiving the monitoring command, start the listening mode to trigger the sound collecting module to collect the ambient sound.
  • Embodiments of the present invention also provide a smart speaker that includes a memory, a processor, and at least one application stored in the memory and configured to be executed by the processor, the application being configured It is used to implement the aforementioned security protection method.
  • a security protection method provided by an embodiment of the present invention by monitoring an environmental sound, when an environmental sound is abnormal, sending an alarm message, thereby realizing remote monitoring of a target location such as a home or an office, and realizing the target location.
  • the safety protection has improved the security level of the target site.
  • the remote monitoring protection scheme of the embodiment of the present invention does not expose the image of the target location, so the user privacy can be better protected, and the remote monitoring can be realized by using the smart home device such as the existing smart speaker.
  • the cost is low, the concealment is good, it is not easy to be discovered and destroyed by the intruder, and the monitoring effect is affected by the lens occlusion, illumination and the like, thereby greatly improving the stability and effectiveness of the safety protection.
  • FIG. 1 is a flow chart of a first embodiment of a security protection method of the present invention
  • FIG. 2 is a flow chart of a second embodiment of the security protection method of the present invention.
  • FIG. 3 is a schematic block diagram of a first embodiment of the safety protection device of the present invention.
  • FIG. 4 is a block diagram of the abnormality determining module of FIG. 3;
  • FIG. 5 is another block diagram of the abnormality determining module of FIG. 3;
  • FIG. 6 is a schematic block diagram of a second embodiment of the safety protection device of the present invention.
  • FIG. 7 is a schematic block diagram of a third embodiment of the safety protection device of the present invention.
  • FIG. 8 is a block diagram of the abnormality determining module of FIG. 7.
  • terminal and terminal device used herein include both a device of a wireless signal receiver, a device having only a wireless signal receiver without a transmitting capability, and a receiving and receiving device.
  • Such a device may comprise: a cellular or other communication device having a single line display or a multi-line display or a cellular or other communication device without a multi-line display; PCS (Persona 1 Communications Service), which may combine voice, Data processing, fax and/or data communication capabilities; PDA (Personal Digital Assistant), which can include radio frequency receivers, pagers, Internet/Intranet access, web browsers, notepads, calendars and/or GPS ( Global Positioning System, Receiver; Conventional laptop and/or palmtop computer or other device having a conventional laptop and/or palmtop computer or other device that includes and/or includes a radio frequency receiver.
  • PCS Personala 1 Communications Service
  • PDA Personal Digital Assistant
  • terminal may be portable, transportable, installed in a vehicle (aviation, sea and/or land), or adapted and/or configured to operate locally, and/or Run in any other location on the Earth and/or space in a distributed fashion.
  • the "terminal” and “terminal device” used may also be a communication terminal, an internet terminal, a music/video playing terminal, and may be, for example, a PDA, a MID (Mobile Internet Device), and/or a music/video playing function.
  • Mobile phones can also be smart TVs, set-top boxes and other devices.
  • the server used herein includes, but is not limited to, a computer, a network host, a single network server, a plurality of network server sets, or a plurality of servers.
  • the cloud consists of a large number of computers or network servers based on Cloud Computing, which is a kind of distributed computing, a super virtual computer composed of a group of loosely coupled computers.
  • communication may be implemented by any communication means between the server, the terminal device and the WNS server, including but not limited to, mobile communication based on 3GPP, LTE, WIMAX, and computer network communication based on TCP/IP and UDP protocols. And short-range wireless transmission based on Bluetooth and infrared transmission standards.
  • the security protection method and the security protection device of the embodiments of the present invention are mainly applied to smart home devices such as smart speakers and smart televisions, and can also be applied to other terminal devices, which is not limited by the present invention.
  • smart home devices such as smart speakers and smart televisions
  • other terminal devices which is not limited by the present invention.
  • the following is a detailed description of the application to the smart speaker.
  • a first embodiment of a security protection method according to the present invention is provided.
  • the method includes the following steps:
  • the ambient sound of the target place is collected by the microphone at a certain sampling frequency.
  • the sampling frequency can be set according to actual needs, such as setting to 16KHZ or higher.
  • the microphone of the smart speaker preferably includes a plurality of microphones and constitutes a microphone array.
  • the intelligent sound box collects sound signals from the environment through the microphone array's pickups, and transmits them to the digital signal processor (DSP) through the audio interface to sample and quantify the audio signals.
  • DSP digital signal processor
  • the user can remotely control the smart speaker to enable the monitoring function.
  • the user sends a monitoring instruction to the smart speaker through a user terminal such as a mobile phone, a tablet, a personal computer, etc.
  • the intelligent speaker starts the listening mode, and proceeds to step S1 l to start or determine Collect ambient sounds.
  • the user can manually activate the monitoring function of the smart speaker when leaving the home, and when the monitoring function is turned on, the smart speaker starts the listening mode.
  • the user can set the monitoring segment, and when entering the monitoring segment, the smart speaker automatically starts monitoring. Mode, when the monitor is off, the smart speaker automatically turns off the monitor mode.
  • step S12. Determine whether the ambient sound is abnormal. When the ambient sound is abnormal, proceed to step S13, otherwise continue to monitor the ambient sound.
  • step S12 the smart speaker analyzes and processes the sampled ambient sound to determine whether the ambient sound is hoisted.
  • the smart speaker determines whether the volume of the ambient sound is greater than or equal to a threshold, and determines that the ambient sound is abnormal when the volume of the ambient sound is greater than or equal to the threshold.
  • the smart speaker can average the volume of the sampling points of the plurality of ambient sounds collected by the microphone array (such as an arithmetic mean), and then compare the average value with a preset threshold value, when the average If the value is greater than or equal to the threshold ⁇ , the volume of the ambient sound is greater than or equal to the threshold, and the ambient snoring is determined.
  • a preset threshold value such as an arithmetic mean
  • the smart speaker first determines whether the volume of the ambient sound is greater than or equal to a threshold, and when the volume of the ambient sound is greater than or equal to the threshold, determining whether there is continuous ⁇ ( ⁇ >2) ambient sounds.
  • the volume of the sampling point is greater than or equal to the preset value. If yes, it is determined that the ambient sound is abnormal, otherwise the ambient sound is determined to be normal. This way makes the judgment more accurate and prevents misjudgment.
  • the smart speaker may first average the volume of the sampling points of the plurality of ambient sounds collected by the microphone array (such as an arithmetic mean), and then compare the average value with a preset threshold value. If the average value is greater than or equal to the threshold value, the volume of the ambient sound is greater than or equal to the threshold value, and then it is determined whether the volume of the sampling point of the continuous ambient sound is greater than or equal to the preset value, and if so, the ambient sound is determined to be abnormal, otherwise It is determined that the ambient sound is normal.
  • It can be set according to actual needs, for example, it is set within the range of 5-10.
  • the foregoing threshold may be one, or at least two, that is, each segment corresponds to one threshold.
  • the preset value may be one or at least two, that is, each segment corresponds to a preset value.
  • the threshold value and the preset value may be set according to the volume statistics of the ambient sound in daily life, that is, the normal volume value of the ambient sound in daily life is counted, and then the normal volume value is set as a threshold or a preset value, or A buffer value is added as a threshold or a preset value based on the normal volume value.
  • the threshold and preset values may or may not be equal.
  • the smart speaker statistical microphone array is collected in a daytime period (eg, 10:00-23:59).
  • the arithmetic mean of the volume of the ambient sound is used as the normal volume value of the daytime segment, and the arithmetic mean of the volume of the ambient sound collected during the nighttime segment (eg 0:00-9:59) is used as the normal volume value of the nighttime segment.
  • the same or different buffer values are respectively added as the threshold value and the preset value of the daytime segment, and the same or different values are respectively added on the basis of the normal volume value of the nighttime segment.
  • the buffer value is used as the threshold and preset value for the nighttime segment.
  • the normal volume value of the daytime segment is generally in the range of 40db-50db, and the normal volume value of the nighttime segment is generally in the range of 30db-40db. Further, the aforementioned normal volume value may be periodically updated to adapt to the current indoor noise environment.
  • the smart speaker when the ambient sound is abnormal, the smart speaker immediately sends an alarm message to the server, and after receiving the alarm information, the server immediately pushes the alarm information to the designated user terminal.
  • the smart speaker can also send an alarm message directly to the user terminal.
  • the user terminal may send a voice message and/or display graphic information to remind the user that the user terminal may be a mobile terminal such as a mobile phone or a tablet, or may be a computer terminal such as a personal computer or a notebook computer.
  • the user can take security measures to avoid or reduce losses.
  • the intelligent speaker can also sound an alarm to take appropriate safety measures to shock the entrant or attract the attention of nearby residents.
  • step S12 when it is determined in step S12 that the ambient sound is abnormal, the following steps are further included:
  • step S14 Detect whether voice information is included in the ambient sound. When the voice information is included in the ambient sound, the process proceeds to step S15; when the voice message is not included in the environment sound, the process ends.
  • the intelligent speaker preferably utilizes a linear superposition principle of a voice signal and a non-speech audio signal, and uses a voice activity detection algorithm (VAD) to perform spatial and frequency domain feature analysis on the ambient sound to determine the environment. Whether the voice contains voice information.
  • VAD voice activity detection algorithm
  • the intelligent speaker processes the collected ambient sound according to the frame, and the length of each frame is set according to the characteristics of the collected sound signal, and the parameter feature value of each frame of the sound signal is extracted by the voice activity detection algorithm, and the parameter feature value is compared.
  • the size of the threshold When the parameter characteristic value is greater than or equal to the threshold value ⁇ , the frame is determined to be a speech frame; when the parameter value is less than the threshold value ⁇ , it is determined that the frame is not a speech frame and is noise.
  • Voice ⁇ The main features of the domain and frequency domain are as follows:
  • the speech frame has a relatively high short-lived energy, and the short-twist energy feature is easy to extract, and in the lower noise environment, there is obvious recognition performance. Since the background sound is additive noise to the speech, the short burst energy should be different when the person starts to speak and does not start talking.
  • the pretreatment step is required before extracting the short energy characteristics.
  • the energy characteristics of the speech signal also include that the energy of the speech is distributed in all frequency bands, and is more prominent around the pitch frequency and the first two formant frequencies. At the same time, different types of noise also have different effects in various frequency bands. Some frequency energy distributions are concentrated. For example, car noise is mainly distributed in the low frequency band.
  • the sub-band energy extraction ⁇ can divide the full frequency band from low frequency to high frequency into four sub-bands, respectively 250-2000 Hz, 2000 Hz to 4000 Hz, 4000 Hz.
  • the short-cut zero-crossing rate reflects the number of times the voice signal's waveform in one frame passes through the horizontal axis representing the zero level. For the sampled signal, if the adjacent sample point amplitude changes the sign, it is called zero-crossing, and the number of times the symbol is changed within one frame is called the zero-crossing rate of the frame. In terms of speech characteristics, the voiced audio rate is lower, with a lower average zero-crossing rate, about 14/lOms; the clear audio frequency is higher, with a higher average zero-crossing rate, about 47/10ms. The noise or silence zero-crossing rate is between the unvoiced and voiced sounds or less than the voiced sound. Therefore, the short zero crossing rate is a significant feature for detecting voiced frames and is not affected by power and amplitude. Therefore, the combination of short-cut zero-crossing rate and short-twist energy is a relatively effective detection algorithm.
  • step S15 when it is detected that the ambient sound includes voice information, the smart speaker extracts the voice information from the ambient sound, and immediately sends the voice information to the user terminal or the server, and the server immediately receives the voice information, and immediately Push voice information to the specified user terminal.
  • the intelligent speaker can use the audio-video peer-to-peer network transmission technology (or the audio and video P2P transmission technology) to directly send the voice information to the user terminal, so that the voice information is not obtained by a third party such as a server, but only acquired by the user end, and the voice information is improved. Information security.
  • the user terminal may prompt the user to play the voice information or directly play the voice information.
  • the user terminal may be a mobile terminal such as a mobile phone or a tablet, or may be Computer terminals such as personal computers and laptops. Therefore, the user can further understand the specific situation of the scene according to the voice information, determine whether the family actually has an intrusion, improve the accuracy of the judgment, and prevent misjudgment.
  • the user terminal can send an alarm signal to the smart speaker through the user terminal, and the smart speaker receives the alarm signal, and then sounds an alarm to shock. Intruders or the attention of nearby residents to take appropriate security measures.
  • a specific application may be installed on a user terminal, and the user terminal communicates with the smart speaker through the specific application, for example, the user terminal sends a monitoring instruction to the smart speaker through a specific application, by using a specific The application receives alarm information, voice information, etc. sent by the smart speaker, sends an alarm signal to the smart speaker through a specific application, and the like.
  • the smart speaker can also determine whether the ambient sound is abnormal by: determining whether the ambient sound contains voice information, and when the ambient sound includes voice information, determining that the ambient sound is abnormal.
  • the smart speaker when it is determined that the ambient sound is abnormal, the smart speaker further extracts the voice information from the ambient sound and transmits the voice information outward.
  • the smart speaker For the specific sending manner, refer to the foregoing embodiment, and details are not described herein again.
  • the foregoing embodiments may also combine the manner in which the ambient sound is determined to be abnormal. For example, it is preferred to determine whether the volume of the ambient sound is greater than or equal to the threshold. If yes, determine whether the ambient sound contains voice information, and when the ambient sound includes voice information, determine that the ambient sound is abnormal. For example, it is preferred to determine whether the volume of the ambient sound is greater than or equal to the threshold; if yes, determine whether the volume of the sampling points of the continuous N ambient sounds is greater than or equal to the preset value, and if so, whether the ambient sound contains the voice information. When the voice information is included in the ambient sound, it is determined that the ambient sound is abnormal.
  • the security protection method of the embodiment of the present invention by monitoring the ambient sound, sends an alarm message when the ambient sound is abnormal, thereby realizing remote monitoring of a target location such as a home or an office, thereby realizing security protection to the target location.
  • a target location such as a home or an office
  • security protection to the target location improve the security level of the target location.
  • the remote monitoring protection scheme of the embodiment of the present invention does not expose the image of the target location, and thus can be better protected.
  • peers can use the smart home devices such as existing smart speakers to achieve remote monitoring, low cost, good concealment, not easy to be destroyed by intruders, and avoid image blocking, lighting and other issues affecting the monitoring effect, so Greatly improve the stability and effectiveness of security protection.
  • the device includes a sound collection module 10, an abnormality determination module 20, and an abnormality alarm module 30, wherein: the sound collection module 10 is configured to collect ambient sounds.
  • the abnormality determining module 20 is configured to determine whether the ambient sound is abnormal; the abnormality alarm module 30 is set to send an alarm message outward when the ambient sound is abnormal.
  • the sound collecting module 10 collects the ambient sound of the target place through the microphone at a certain sampling frequency.
  • the sampling frequency can be set according to actual needs, such as 16KHZ or higher.
  • the microphone of the smart speaker preferably includes a plurality of microphones and constitutes a microphone array.
  • the user can remotely control the smart speaker to enable the monitoring function.
  • the security protection device further includes a monitoring startup module, and the user sends a monitoring instruction to the smart speaker through a user terminal such as a mobile phone, a tablet, a personal computer, etc., after the intelligent speaker receives the monitoring instruction, the monitoring startup module starts the monitoring mode to trigger the sound collection module.
  • the sound of the collection environment is calculated or fixed at 10 o'clock.
  • the user can manually activate the monitoring function of the smart speaker when leaving the home.
  • the smart speaker starts the listening mode by monitoring the startup module.
  • the user can set the monitoring section, and when entering the monitoring section, the intelligent speaker automatically starts the monitoring mode by monitoring the startup module, and when the monitoring section is off, the intelligent speaker automatically turns off the listening mode by monitoring the shutdown module. .
  • the abnormality determining module 20 analyzes and processes the ambient sound sampled by the sound collecting module 10 to determine whether the ambient sound is abnormal.
  • the abnormality determining module 20 includes a first determining unit 21 and a first determining unit 22, as shown in FIG. 4, wherein: the first determining unit 21 is configured to determine whether the volume of the ambient sound is greater than or Equal to the threshold; the first determining unit 22 is configured to determine that the ambient sound is abnormal when the volume of the ambient sound is greater than or equal to the threshold ⁇ .
  • the first determining unit 21 obtains an average value (such as an arithmetic mean value) of the sampling points of the plurality of ambient sounds collected by the microphone array, and then compares the average value with a preset threshold value, when If the average value is greater than or equal to the threshold ⁇ , then the volume of the ambient sound is greater than or equal to the threshold. When the volume of the ambient sound is greater than or equal to the threshold ⁇ , the first decision unit 22 determines that the ambient sound is abnormal, otherwise it determines that the ambient sound is normal.
  • an average value such as an arithmetic mean value
  • the abnormality determining module 20 includes a first determining unit 21, a second determining unit 23, and a second determining unit 24, as shown in FIG. 5, wherein: the first determining unit 21 is configured to determine Whether the volume of the ambient sound is greater than or equal to the threshold; the second determining unit 23 is configured to determine whether the volume of the sampling points of consecutive N (N>2) ambient sounds is greater than or equal to the threshold value ⁇ The second determining unit 24 is configured to determine that the ambient sound is abnormal when the volume of the sampling point with consecutive N ambient sounds is greater than or equal to the preset value.
  • the first determining unit 21 obtains an average value (such as an arithmetic mean value) of the sampling points of the plurality of ambient sounds collected by the microphone array, and then compares the average value with a preset threshold value. When the average value is greater than or equal to the threshold ⁇ , the volume of the ambient sound is greater than or equal to the threshold. When the volume of the ambient sound is greater than or equal to the threshold ⁇ , the second determining unit 23 determines whether the volume of the sampling points of the consecutive N ambient sounds is greater than or equal to the preset value.
  • an average value such as an arithmetic mean value
  • N can be set according to actual needs, for example, set within the range of 5-10.
  • the foregoing threshold may be one or at least two, that is, each segment corresponds to one threshold.
  • the preset value may be one or at least two, that is, each segment corresponds to a preset value.
  • the threshold value and the preset value may be set according to the volume statistics of the ambient sound in daily life, that is, the normal volume value of the ambient sound in daily life is counted, and then the normal volume value is set as a threshold or a preset value, or A buffer value is added as a threshold or a preset value based on the normal volume value.
  • the threshold and preset values may or may not be equal.
  • the abnormality alarm module 30 When the ambient sound is abnormal, the abnormality alarm module 30 immediately sends an alarm message to the server, and after receiving the alarm information, the server immediately pushes the alarm information to the designated user terminal.
  • the abnormality alarm module 30 can also directly send alarm information to the user terminal.
  • the user terminal After receiving the alarm information, the user terminal may send a voice message and/or display graphic information to remind the user that the user terminal may be a mobile terminal such as a mobile phone or a tablet, or may be a computer terminal such as a personal computer or a notebook computer.
  • the abnormality alarm module 30 can also sound an alarm to take appropriate safety measures to shock the entrant or attract the attention of nearby residents.
  • the device further includes a voice detection module 40 and a voice sending module 50, where: the voice detection module 40 is set to be an environment The voice is abnormally ⁇ , detecting whether the voice information is included in the ambient sound; and the voice sending module 50 is configured to send the voice information to the outside when the voice voice is included in the ambient voice.
  • the intelligent speaker preferably utilizes a linear superposition principle of a voice signal and a non-speech audio signal, and uses a voice activity detection algorithm (VAD) to perform spatial and frequency domain feature analysis on the ambient sound to determine whether the ambient sound is included. voice message.
  • VAD voice activity detection algorithm
  • the voice detection module 40 processes the collected ambient sound according to the frame, and the length of each frame is set according to the characteristics of the collected sound signal, and the parameter feature value of each frame of the sound signal is extracted by the voice activity detection algorithm, and the parameter is compared. The size of the eigenvalue and threshold.
  • the parameter characteristic value is greater than or equal to the threshold value ⁇ , it is determined that the frame is a speech frame; when the parameter value is less than the threshold value ⁇ , it is determined that the frame is not a speech frame and is noise.
  • the voice sending module 50 extracts the voice information from the ambient sound, and immediately sends the voice information to the user terminal or the server, and after receiving the voice information, the server immediately transmits the voice information.
  • the information is pushed to the specified user terminal.
  • the voice sending module 50 uses the audio-video peer-to-peer network transmission technology (or the audio and video P2P transmission technology) to directly send voice information to the user terminal, so that the voice information is not obtained by a third party such as a server, but is only obtained by the user terminal, thereby improving The security of the information.
  • the user terminal may prompt the user to play the voice information or directly play the voice information.
  • the user terminal may be a mobile terminal such as a mobile phone or a tablet, or may be a computer terminal such as a personal computer or a notebook computer. Therefore, the user can further understand the specific situation on the spot according to the voice information, determine whether the family actually has an intrusion, improve the accuracy of the judgment, and prevent misjudgment.
  • the user terminal may send an alarm signal to the smart speaker, and when the abnormal alarm module 30 receives the alarm signal, an alarm sound is generated. Take appropriate safety measures to shock the entrants or attract the attention of nearby residents.
  • FIG. 8 a third embodiment of the safety guard of the present invention is presented.
  • the abnormality determining module 2 of this embodiment As shown in FIG. 8, the third determining unit 25 and the third determining unit 26 are included, wherein: the third determining unit 25 is configured to determine whether voice information is included in the ambient sound; and the third determining unit 26 is configured to be an ambient sound.
  • the voice information is included, and the ambient sound is abnormal.
  • the third judging unit 25 analyzes whether the voice information is included in the environment sound, and the manner of analyzing and judging by the voice detecting module 40 in the foregoing embodiment is the same, and details are not described herein again.
  • the apparatus further includes a voice transmitting module 50 configured to send the voice information outward when the ambient sound is abnormal. Therefore, the user can further understand the specific situation on the spot according to the voice information, determine whether the family actually has an intrusion, improve the accuracy of the judgment, and prevent misjudgment.
  • the abnormality determination module 20 may also combine the manner in which the foregoing embodiment determines whether the ambient sound is abnormal. For example, the abnormality determining module 20 first determines whether the volume of the ambient sound is greater than or equal to the threshold. If yes, it determines whether the ambient sound contains voice information, and when the ambient sound includes the voice information, it determines that the ambient sound is abnormal. For example, the abnormality determining module 20 first determines whether the volume of the ambient sound is greater than or equal to the threshold; if yes, determining whether the volume of the sampling points of the consecutive N ambient sounds is greater than or equal to the preset value, and if so, determining the ambient sound. Whether or not the voice information is included, and when the voice information is included in the ambient sound, it is determined that the environmental sound is abnormal.
  • the security protection device of the embodiment of the present invention transmits an alarm message when the ambient sound is abnormal, by monitoring the ambient sound, thereby realizing remote monitoring of a target location such as a home or an office, thereby realizing security protection against the target site. Improve the security level of the target location.
  • the remote monitoring protection scheme of the embodiment of the present invention does not expose the image of the target location, so the user privacy can be better protected, and the remote monitoring can be realized by using the smart home device such as the existing smart speaker.
  • the cost is low, the concealment is good, it is not easy to be destroyed by the intruder, and the image occlusion, illumination and the like are avoided, which affects the monitoring effect, thereby greatly improving the stability and effectiveness of the security protection.
  • the present invention also provides a smart speaker that includes a memory, a processor, and at least one application stored in the memory and configured to be executed by the processor, the application being configured to perform security protection method.
  • the security protection method includes the following steps: collecting an ambient sound, determining whether the ambient sound is abnormal, and sending an alarm message when the ambient sound is abnormal. Security described in this embodiment The protection method is the security protection method in the foregoing embodiment of the present invention, and details are not described herein again.
  • the present invention includes apparatus that is directed to performing one or more of the operations described herein.
  • These devices may be specially designed and manufactured for the required purposes, or may also include known devices in a general purpose computer.
  • These devices have computer programs stored therein that are selectively activated or reconfigured.
  • Such computer programs may be stored in a device (eg, computer) readable medium or in any type of medium suitable for storing electronic instructions and respectively coupled to a bus, including but not limited to any Types of disks (including floppy disks, hard disks, CDs, CD-ROMs, and magneto-optical disks), ROM (Read-Only Memory), RAM (Random Access Memory), EPROM (Erasable Programmable Read-Only)
  • a readable medium includes any medium that is stored or transmitted by a device (e.g., a computer) in a readable form.
  • each block of the block diagrams and/or block diagrams and/or flow diagrams can be implemented by computer program instructions, and/or in the block diagrams and/or block diagrams and/or flow diagrams. The combination of boxes.
  • these computer program instructions can be implemented by a general purpose computer, a professional computer, or a processor of other programmable data processing methods, such that the processor is executed by a computer or other programmable data processing method.
  • the block diagrams and/or block diagrams of the invention and/or the schemes specified in the blocks or blocks of the flow diagram are invented.

Abstract

The present invention discloses a security protection method and apparatus, and a smart speaker. The method comprises the following steps: acquiring an ambient sound; determining whether the ambient sound is abnormal; and transmitting alarm information externally if the ambient sound is abnormal. Thereby, a target site such as a home or an office can be remotely monitored, realizing security protection of the target site, and raising the security level of the target site. Compared with video surveillance protection schemes, the remote monitoring protection scheme of embodiments of the present invention does not reveal images of the target site, thereby better ensuring user privacy. Further, remote monitoring can be realized by using existing smart home devices such as smart speakers to reduce costs, while concealing such devices renders them difficult to be found and damaged by intruders, and prevents them from suffering lens occlusion, illumination, and other issues, thus greatly improving stability and effectiveness of security protection.

Description

安全防护方法、 装置和智能音箱 技术领域  Safety protection method, device and intelligent speaker
[0001] 本发明涉及安保技术领域, 特别是涉及到一种安全防护方法、 装置和智能音箱 背景技术  [0001] The present invention relates to the field of security technologies, and in particular, to a security protection method, device, and smart speaker.
[0002] 为了防止外来人员秘密闯入重要的目标场所, 需要对目标场所进行安全防护。  [0002] In order to prevent outsiders from secretly entering an important target location, it is necessary to provide security protection to the target site.
目前的安全防护方案, 主要是在目标场所安装摄像头, 以对目标场所进行视频 监控, 从而实现对目标场所的安全防护。  The current security protection program mainly installs a camera at the target location to perform video surveillance on the target site to achieve security protection for the target site.
技术问题  technical problem
[0003] 然而, 摄像头传输的图像如果被第三方获取, 则会暴露用户隐私, 本身存在安 全隐患; 同吋, 摄像头一般都比较显眼, 很容易被发现, 因此极易被闯入者破 坏; 而且, 摄像头还会因为镜头遮挡、 光照等问题而影响监控效果。 凡此种种 , 都影响了安全防护的稳定性和有效性。  [0003] However, if the image transmitted by the camera is obtained by a third party, the user's privacy will be exposed, and there is a security risk in itself; at the same time, the camera is generally conspicuous and easily found, so it is easily destroyed by the intruder; The camera will also affect the monitoring effect due to problems such as lens occlusion and illumination. All of these factors affect the stability and effectiveness of safety protection.
问题的解决方案  Problem solution
技术解决方案  Technical solution
[0004] 本发明的主要目的为提供一种安全防护方法、 装置和智能音箱, 旨在提高安全 防护的稳定性和有效性。  [0004] A primary object of the present invention is to provide a security protection method, apparatus and smart speaker for improving the stability and effectiveness of security protection.
[0005] 为达以上目的, 本发明实施例提出一种安全防护方法, 所述方法包括以下步骤 [0005] In order to achieve the above objective, an embodiment of the present invention provides a security protection method, where the method includes the following steps.
[0006] 采集环境声音; [0006] collecting ambient sounds;
[0007] 判断所述环境声音是否异常;  [0007] determining whether the ambient sound is abnormal;
[0008] 当所述环境声音异常吋, 向外发送报警信息。  [0008] When the ambient sound is abnormal, the alarm information is sent out.
[0009] 可选地, 所述判断所述环境声音是否异常包括:  [0009] Optionally, the determining whether the ambient sound is abnormal includes:
[0010] 判断所述环境声音的音量是否大于或等于阈值;  [0010] determining whether the volume of the ambient sound is greater than or equal to a threshold;
[0011] 当所述环境声音的音量大于或等于所述阈值吋, 判定所述环境声音异常。  [0011] When the volume of the ambient sound is greater than or equal to the threshold 吋, it is determined that the ambient sound is abnormal.
[0012] 可选地, 所述判断所述环境声音是否异常包括: [0013] 判断所述环境声音的音量是否大于或等于阈值; [0012] Optionally, the determining whether the ambient sound is abnormal includes: [0013] determining whether the volume of the ambient sound is greater than or equal to a threshold;
[0014] 当所述环境声音的音量大于或等于所述阈值吋, 判断是否有连续 N个所述环境 声音的采样点的音量均大于或等于预设值, 其中 N大于或等于 2;  [0014] when the volume of the ambient sound is greater than or equal to the threshold 吋, determining whether there is a continuous N of the ambient sound sampling point volume is greater than or equal to a preset value, wherein N is greater than or equal to 2;
[0015] 若是, 则判定所述环境声音异常。  [0015] If yes, it is determined that the ambient sound is abnormal.
[0016] 可选地, 所述判断所述环境声音是否异常的步骤之后还包括:  [0016] Optionally, after the step of determining whether the ambient sound is abnormal, the method further includes:
[0017] 当所述环境声音异常吋, 检测所述环境声音中是否包含语音信息;  [0017] detecting abnormality in the ambient sound, whether the voice information is included in the ambient sound;
[0018] 当所述环境声音中包含语音信息吋, 向外发送所述语音信息。  [0018] when the ambient sound includes voice information, the voice information is sent out.
[0019] 可选地, 所述检测所述环境声音中是否包含语音信息包括:  [0019] Optionally, the detecting whether the voice information is included in the ambient sound comprises:
[0020] 采用语音活动检测算法对所述环境声音进行吋域和频域特征分析, 判断所述环 境声音中是否包含语音信息。  [0020] Performing a domain and frequency domain feature analysis on the ambient sound by using a voice activity detection algorithm to determine whether voice information is included in the ambient sound.
[0021] 可选地, 所述向外发送所述语音信息包括: 采用音视频对等网络传输技术向用 户终端发送所述语音信息。  [0021] Optionally, the sending the voice information outward comprises: sending the voice information to a user terminal by using an audio-video peer-to-peer network transmission technology.
[0022] 可选地, 所述阈值至少有两个, 不同的吋段对应不同的阈值。  [0022] Optionally, the threshold has at least two, and different segments correspond to different thresholds.
[0023] 可选地, 所述预设值至少有两个, 不同的吋段对应不同的预设值。  [0023] Optionally, the preset value has at least two, and different segments correspond to different preset values.
[0024] 可选地, 所述判断所述环境声音是否异常包括:  [0024] Optionally, the determining whether the ambient sound is abnormal includes:
[0025] 判断所述环境声音中是否包含语音信息;  [0025] determining whether voice information is included in the ambient sound;
[0026] 当所述环境声音中包含语音信息吋, 判定所述环境声音异常。  [0026] When the ambient sound includes voice information 吋, it is determined that the ambient sound is abnormal.
[0027] 可选地, 所述判断所述环境声音是否异常的步骤之后还包括:  [0027] Optionally, after the step of determining whether the ambient sound is abnormal, the method further includes:
[0028] 当所述环境声音异常吋, 向外发送所述语音信息。  [0028] When the ambient sound is abnormal, the voice information is sent out.
[0029] 可选地, 所述采集环境声音的步骤之前还包括:  [0029] Optionally, the step of collecting an ambient sound further includes:
[0030] 当接收到监听指令吋, 启动监听模式, 并进入下一步骤: 采集环境声音。  [0030] When the listen command is received, the listening mode is started, and the next step is taken: the ambient sound is collected.
[0031] 本发明实施例同吋提出一种安全防护装置, 所述装置包括:  [0031] Embodiments of the present invention also provide a security protection device, and the device includes:
[0032] 声音采集模块, 设置为采集环境声音;  [0032] a sound collection module, configured to collect an ambient sound;
[0033] 异常判断模块, 设置为判断所述环境声音是否异常;  [0033] the abnormality determining module is configured to determine whether the ambient sound is abnormal;
[0034] 异常报警模块, 设置为当所述环境声音异常吋, 向外发送报警信息。  [0034] The abnormality alarm module is configured to send an alarm message outward when the ambient sound is abnormal.
[0035] 可选地, 所述异常判断模块包括:  [0035] Optionally, the abnormality determining module includes:
[0036] 第一判断单元, 设置为判断所述环境声音的音量是否大于或等于阈值;  [0036] The first determining unit is configured to determine whether the volume of the ambient sound is greater than or equal to a threshold;
[0037] 第一判决单元, 设置为当所述环境声音的音量大于或等于所述阈值吋, 判定所 述环境声音异常。 [0037] a first determining unit, configured to: when the volume of the ambient sound is greater than or equal to the threshold 吋, The ambient sound is abnormal.
[0038] 可选地, 所述异常判断模块包括:  [0038] Optionally, the abnormality determining module includes:
[0039] 第一判断单元, 设置为判断所述环境声音的音量是否大于或等于阈值;  [0039] The first determining unit is configured to determine whether the volume of the ambient sound is greater than or equal to a threshold;
[0040] 第二判断单元, 设置为当所述环境声音的音量大于或等于所述阈值吋, 判断是 否有连续 N个所述环境声音的采样点的音量均大于或等于预设值, 其中 N大于或 等于 2;  [0040] The second determining unit is configured to: when the volume of the ambient sound is greater than or equal to the threshold 吋, determine whether a volume of consecutive N sampling points of the ambient sound is greater than or equal to a preset value, where N Greater than or equal to 2;
[0041] 第二判决单元, 设置为当有连续 N个所述环境声音的采样点的音量均大于或等 于预设值吋, 判定所述环境声音异常。  [0041] The second determining unit is configured to determine that the ambient sound is abnormal when the volume of the sampling points having consecutive N of the ambient sounds is greater than or equal to a preset value 。.
[0042] 可选地, 所述装置还包括: [0042] Optionally, the device further includes:
[0043] 语音检测模块, 设置为当所述环境声音异常吋, 检测所述环境声音中是否包含 语首 息;  [0043] a voice detection module, configured to detect whether the ambient sound includes a linguistic first when the ambient sound is abnormally 吋;
[0044] 语音发送模块, 设置为当所述环境声音中包含语音信息吋, 向外发送所述语音 f π息。  [0044] The voice sending module is configured to: when the ambient sound includes voice information, send the voice to the outside.
[0045] 可选地, 所述语音检测模块设置为: 采用语音活动检测算法对所述环境声音进 行吋域和频域特征分析, 判断所述环境声音中是否包含语音信息。  [0045] Optionally, the voice detection module is configured to: perform a domain and frequency domain feature analysis on the ambient sound by using a voice activity detection algorithm, and determine whether the voice information is included in the environment sound.
[0046] 可选地, 所述语音发送模块设置为: 采用音视频对等网络传输技术向用户终端 发送所述语音信息。  [0046] Optionally, the voice sending module is configured to: send the voice information to the user terminal by using an audio-video peer-to-peer network transmission technology.
[0047] 可选地, 所述异常判断模块包括:  [0047] Optionally, the abnormality determining module includes:
[0048] 第三判断单元, 设置为判断所述环境声音中是否包含语音信息;  [0048] a third determining unit, configured to determine whether voice information is included in the ambient sound;
[0049] 第三判决单元, 设置为当所述环境声音中包含语音信息吋, 判定所述环境声音 计吊。  [0049] The third determining unit is configured to determine that the environmental sound meter is suspended when the ambient sound includes voice information.
[0050] 可选地, 所述装置还包括语音发送模块, 其设置为: 当所述环境声音异常吋, 向外发送所述语音信息。  [0050] Optionally, the device further includes a voice sending module, configured to: when the ambient sound is abnormal, send the voice information outward.
[0051] 可选地, 所述装置还包括监听启动模块, 其设置为: 当接收到监听指令吋, 启 动监听模式, 以触发所述声音采集模块采集环境声音。 Optionally, the device further includes a monitoring and starting module, configured to: when receiving the monitoring command, start the listening mode to trigger the sound collecting module to collect the ambient sound.
[0052] 本发明实施例还提出一种智能音箱, 其包括存储器、 处理器和至少一个被存储 在所述存储器中并被配置为由所述处理器执行的应用程序, 所述应用程序被配 置为用于执行前述安全防护方法。 发明的有益效果 [0052] Embodiments of the present invention also provide a smart speaker that includes a memory, a processor, and at least one application stored in the memory and configured to be executed by the processor, the application being configured It is used to implement the aforementioned security protection method. Advantageous effects of the invention
有益效果  Beneficial effect
[0053] 本发明实施例所提供的一种安全防护方法, 通过监听环境声音, 当环境声音异 常吋, 则发送报警信息, 从而实现了对家、 办公室等目标场所的远程监听, 实 现对目标场所的安全防护, 提高了目标场所的安保水平。 相对于视频监控防护 方案, 本发明实施例的远程监听防护方案不会暴露目标场所的图像, 因此能够 更好的保护用户隐私, 同吋可以利用现存的智能音箱等智能家居设备实现远程 监听, 实现成本低, 隐蔽性好, 不易被闯入者发现和破坏, 并且避免了因镜头 遮挡、 光照等问题而影响监控效果, 从而极大的提高了安全防护的稳定性和有 效性。  [0053] A security protection method provided by an embodiment of the present invention, by monitoring an environmental sound, when an environmental sound is abnormal, sending an alarm message, thereby realizing remote monitoring of a target location such as a home or an office, and realizing the target location. The safety protection has improved the security level of the target site. Compared with the video surveillance protection scheme, the remote monitoring protection scheme of the embodiment of the present invention does not expose the image of the target location, so the user privacy can be better protected, and the remote monitoring can be realized by using the smart home device such as the existing smart speaker. The cost is low, the concealment is good, it is not easy to be discovered and destroyed by the intruder, and the monitoring effect is affected by the lens occlusion, illumination and the like, thereby greatly improving the stability and effectiveness of the safety protection.
对附图的简要说明  Brief description of the drawing
附图说明  DRAWINGS
[0054] 图 1是本发明的安全防护方法第一实施例的流程图;  1 is a flow chart of a first embodiment of a security protection method of the present invention;
[0055] 图 2是本发明的安全防护方法第二实施例的流程图; 2 is a flow chart of a second embodiment of the security protection method of the present invention;
[0056] 图 3是本发明的安全防护装置第一实施例的模块示意图; 3 is a schematic block diagram of a first embodiment of the safety protection device of the present invention;
[0057] 图 4是图 3中的异常判断模块的模块示意图; 4 is a block diagram of the abnormality determining module of FIG. 3;
[0058] 图 5是图 3中的异常判断模块的又一模块示意图; 5 is another block diagram of the abnormality determining module of FIG. 3;
[0059] 图 6是本发明的安全防护装置第二实施例的模块示意图; 6 is a schematic block diagram of a second embodiment of the safety protection device of the present invention;
[0060] 图 7是本发明的安全防护装置第三实施例的模块示意图; 7 is a schematic block diagram of a third embodiment of the safety protection device of the present invention;
[0061] 图 8是图 7中的异常判断模块的模块示意图。 8 is a block diagram of the abnormality determining module of FIG. 7.
[0062] 本发明目的的实现、 功能特点及优点将结合实施例, 参照附图做进一步说明。  [0062] The implementation, functional features, and advantages of the present invention will be further described with reference to the accompanying drawings.
实施该发明的最佳实施例  BEST MODE FOR CARRYING OUT THE INVENTION
本发明的最佳实施方式  BEST MODE FOR CARRYING OUT THE INVENTION
[0063] 应当理解, 此处所描述的具体实施例仅仅用以解释本发明, 并不用于限定本发 明。 The specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
[0064] 下面详细描述本发明的实施例, 所述实施例的示例在附图中示出, 其中自始至 终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。 下 面通过参考附图描述的实施例是示例性的, 仅用于解释本发明, 而不能解释为 对本发明的限制。 The embodiments of the present invention are described in detail below, and the examples of the embodiments are illustrated in the drawings, wherein the same or similar reference numerals are used to refer to the same or similar elements or elements having the same or similar functions. Under The embodiments described with reference to the drawings are exemplified and are not to be construed as limiting the invention.
[0065] 本技术领域技术人员可以理解, 除非特意声明, 这里使用的单数形式"一"、 " 一个"、 "所述 "和"该"也可包括复数形式。 应该进一步理解的是, 本发明的说明 书中使用的措辞"包括"是指存在所述特征、 整数、 步骤、 操作、 元件和 /或组件 , 但是并不排除存在或添加一个或多个其他特征、 整数、 步骤、 操作、 元件、 组件和 /或它们的组。 应该理解, 当我们称元件被"连接"或"耦接"到另一元件吋 , 它可以直接连接或耦接到其他元件, 或者也可以存在中间元件。 此外, 这里 使用的"连接"或"耦接"可以包括无线连接或无线耦接。 这里使用的措辞 "和 /或"包 括一个或更多个相关联的列出项的全部或任一单元和全部组合。  The singular forms "a", "an", "the" and "the" It will be further understood that the phrase "comprising", used in the <RTI ID=0.0> </ RTI> <RTIgt; </ RTI> <RTIgt; </ RTI> <RTIgt; </ RTI> is intended to mean the presence of the features, integers, steps, operations, components and/or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, components, components, and/or their groups. It will be understood that when we refer to an element being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element, or an intermediate element can be present. Further, "connected" or "coupled" as used herein may include either a wireless connection or a wireless coupling. The phrase "and/or" used herein includes all or any of the elements and all combinations of one or more of the associated listed.
[0066] 本技术领域技术人员可以理解, 除非另外定义, 这里使用的所有术语 (包括技 术术语和科学术语) , 具有与本发明所属领域中的普通技术人员的一般理解相 同的意义。 还应该理解的是, 诸如通用字典中定义的那些术语, 应该被理解为 具有与现有技术的上下文中的意义一致的意义, 并且除非像这里一样被特定定 义, 否则不会用理想化或过于正式的含义来解释。  [0066] Those skilled in the art will appreciate that all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs, unless otherwise defined. It should also be understood that terms such as those defined in a general dictionary should be understood to have meaning consistent with the meaning in the context of the prior art, and will not be idealized or excessive unless specifically defined as here. The formal meaning is explained.
[0067] 本技术领域技术人员可以理解, 这里所使用的 "终端"、 "终端设备"既包括无线 信号接收器的设备, 其仅具备无发射能力的无线信号接收器的设备, 又包括接 收和发射硬件的设备, 其具有能够在双向通信链路上, 执行双向通信的接收和 发射硬件的设备。 这种设备可以包括: 蜂窝或其他通信设备, 其具有单线路显 示器或多线路显示器或没有多线路显示器的蜂窝或其他通信设备; PCS (Persona 1 Communications Service, 个人通信系统) , 其可以组合语音、 数据处理、 传真 和 /或数据通信能力; PDA (Personal Digital Assistant, 个人数字助理) , 其可以 包括射频接收器、 寻呼机、 互联网 /内联网访问、 网络浏览器、 记事本、 日历和 / 或 GPS (Global Positioning System, 全球定位系统) 接收器; 常规膝上型和 /或掌 上型计算机或其他设备, 其具有和 /或包括射频接收器的常规膝上型和 /或掌上型 计算机或其他设备。 这里所使用的 "终端"、 "终端设备"可以是便携式、 可运输、 安装在交通工具 (航空、 海运和 /或陆地) 中的, 或者适合于和 /或配置为在本地 运行, 和 /或以分布形式, 运行在地球和 /或空间的任何其他位置运行。 这里所使 用的"终端"、 "终端设备"还可以是通信终端、 上网终端、 音乐 /视频播放终端, 例如可以是 PDA、 MID (Mobile Internet Device, 移动互联网设备) 和 /或具有音 乐 /视频播放功能的移动电话, 也可以是智能电视、 机顶盒等设备。 [0067] Those skilled in the art can understand that the "terminal" and "terminal device" used herein include both a device of a wireless signal receiver, a device having only a wireless signal receiver without a transmitting capability, and a receiving and receiving device. A device that transmits hardware having a receiving and transmitting hardware capable of performing two-way communication over a two-way communication link. Such a device may comprise: a cellular or other communication device having a single line display or a multi-line display or a cellular or other communication device without a multi-line display; PCS (Persona 1 Communications Service), which may combine voice, Data processing, fax and/or data communication capabilities; PDA (Personal Digital Assistant), which can include radio frequency receivers, pagers, Internet/Intranet access, web browsers, notepads, calendars and/or GPS ( Global Positioning System, Receiver; Conventional laptop and/or palmtop computer or other device having a conventional laptop and/or palmtop computer or other device that includes and/or includes a radio frequency receiver. As used herein, "terminal", "terminal device" may be portable, transportable, installed in a vehicle (aviation, sea and/or land), or adapted and/or configured to operate locally, and/or Run in any other location on the Earth and/or space in a distributed fashion. Made here The "terminal" and "terminal device" used may also be a communication terminal, an internet terminal, a music/video playing terminal, and may be, for example, a PDA, a MID (Mobile Internet Device), and/or a music/video playing function. Mobile phones can also be smart TVs, set-top boxes and other devices.
[0068] 本技术领域技术人员可以理解, 这里所使用的服务器, 其包括但不限于计算机 、 网络主机、 单个网络服务器、 多个网络服务器集或多个服务器构成的云。 在 此, 云由基于云计算 (Cloud Computing) 的大量计算机或网络服务器构成, 其 中, 云计算是分布式计算的一种, 由一群松散耦合的计算机集组成的一个超级 虚拟计算机。 本发明的实施例中, 服务器、 终端设备与 WNS服务器之间可通过 任何通信方式实现通信, 包括但不限于, 基于 3GPP、 LTE、 WIMAX的移动通信 、 基于 TCP/IP、 UDP协议的计算机网络通信以及基于蓝牙、 红外传输标准的近 距无线传输方式。 [0068] Those skilled in the art can understand that the server used herein includes, but is not limited to, a computer, a network host, a single network server, a plurality of network server sets, or a plurality of servers. Here, the cloud consists of a large number of computers or network servers based on Cloud Computing, which is a kind of distributed computing, a super virtual computer composed of a group of loosely coupled computers. In the embodiment of the present invention, communication may be implemented by any communication means between the server, the terminal device and the WNS server, including but not limited to, mobile communication based on 3GPP, LTE, WIMAX, and computer network communication based on TCP/IP and UDP protocols. And short-range wireless transmission based on Bluetooth and infrared transmission standards.
[0069] 本发明实施例的安全防护方法和安全防护装置, 主要应用于智能音箱、 智能电 视等智能家居设备, 当然也可以应用于其它的终端设备, 本发明对此不作限定 。 以下以应用于智能音箱为例进行详细说明。  [0069] The security protection method and the security protection device of the embodiments of the present invention are mainly applied to smart home devices such as smart speakers and smart televisions, and can also be applied to other terminal devices, which is not limited by the present invention. The following is a detailed description of the application to the smart speaker.
[0070] 参照图 1, 提出本发明的安全防护方法第一实施例, 所述方法包括以下步骤: [0070] Referring to FIG. 1, a first embodiment of a security protection method according to the present invention is provided. The method includes the following steps:
[0071] Sl l、 采集环境声音。 [0071] Sl l, collecting ambient sound.
[0072] 本发明实施例中, 智能音箱启动监听模式后, 通过麦克风以一定的采样频率采 集目标场所的环境声音。 采样频率可以根据实际需要设定, 如设定为 16KHZ或 者更高。 智能音箱的麦克风优选包括多个, 且组成麦克风阵列。 例如, 智能音 箱通过麦克风阵列的拾音器采集环境中的声音信号, 通过音频接口传输至数字 信号处理器 (Digital Signal Processor, DSP) , 实现音频信号的采样和量化。  In the embodiment of the invention, after the smart speaker starts the listening mode, the ambient sound of the target place is collected by the microphone at a certain sampling frequency. The sampling frequency can be set according to actual needs, such as setting to 16KHZ or higher. The microphone of the smart speaker preferably includes a plurality of microphones and constitutes a microphone array. For example, the intelligent sound box collects sound signals from the environment through the microphone array's pickups, and transmits them to the digital signal processor (DSP) through the audio interface to sample and quantify the audio signals.
[0073] 可选地, 用户可以远程控制智能音箱幵启监听功能。 例如, 在步骤 S11之前, 用户通过手机、 平板、 个人电脑等用户终端向智能音箱发送监听指令, 智能音 箱接收到监听指令后, 则启动监听模式, 进入步骤 Sl l, 幵始实吋或定吋的采集 环境声音。  [0073] Optionally, the user can remotely control the smart speaker to enable the monitoring function. For example, before step S11, the user sends a monitoring instruction to the smart speaker through a user terminal such as a mobile phone, a tablet, a personal computer, etc., after receiving the monitoring instruction, the intelligent speaker starts the listening mode, and proceeds to step S1 l to start or determine Collect ambient sounds.
[0074] 可选地, 用户可以在离家外出吋手动幵启智能音箱的监听功能, 当监听功能幵 启后, 智能音箱则启动监听模式。  [0074] Optionally, the user can manually activate the monitoring function of the smart speaker when leaving the home, and when the monitoring function is turned on, the smart speaker starts the listening mode.
[0075] 可选地, 用户可以设置监听吋段, 当进入监听吋段吋, 智能音箱自动启动监听 模式, 当离幵监听吋段吋, 智能音箱自动关闭监听模式。 [0075] Optionally, the user can set the monitoring segment, and when entering the monitoring segment, the smart speaker automatically starts monitoring. Mode, when the monitor is off, the smart speaker automatically turns off the monitor mode.
[0076] S12、 判断环境声音是否异常。 当环境声音异常吋, 进入步骤 S13, 否则继续监 听环境声音。  [0076] S12. Determine whether the ambient sound is abnormal. When the ambient sound is abnormal, proceed to step S13, otherwise continue to monitor the ambient sound.
[0077] 本步骤 S12中, 智能音箱对采样的环境声音进行分析处理, 判断环境声音是否 计吊。  [0077] In step S12, the smart speaker analyzes and processes the sampled ambient sound to determine whether the ambient sound is hoisted.
[0078] 在一些实施例中, 智能音箱判断环境声音的音量是否大于或等于阈值, 当环境 声音的音量大于或等于阈值吋, 判定环境声音异常。  [0078] In some embodiments, the smart speaker determines whether the volume of the ambient sound is greater than or equal to a threshold, and determines that the ambient sound is abnormal when the volume of the ambient sound is greater than or equal to the threshold.
[0079] 具体的, 智能音箱可以将麦克风阵列采集到的多个环境声音的采样点的音量求 取平均值 (如算术平均值) , 然后比较平均值与预先设定的阈值的大小, 当平 均值大于或等于阈值吋, 则说明环境声音的音量大于或等于阈值, 判定环境声 曰计吊。  [0079] Specifically, the smart speaker can average the volume of the sampling points of the plurality of ambient sounds collected by the microphone array (such as an arithmetic mean), and then compare the average value with a preset threshold value, when the average If the value is greater than or equal to the threshold 吋, the volume of the ambient sound is greater than or equal to the threshold, and the ambient snoring is determined.
[0080] 在另一些实施例中, 智能音箱先判断环境声音的音量是否大于或等于阈值, 当 环境声音的音量大于或等于阈值吋, 再判断是否有连续 Ν (Ν>2) 个环境声音的 采样点的音量均大于或等于预设值, 若是, 则判定环境声音异常, 否则判定环 境声音正常。 这种方式使得判断更加准确, 防止误判。  [0080] In other embodiments, the smart speaker first determines whether the volume of the ambient sound is greater than or equal to a threshold, and when the volume of the ambient sound is greater than or equal to the threshold, determining whether there is continuous Ν (Ν>2) ambient sounds. The volume of the sampling point is greater than or equal to the preset value. If yes, it is determined that the ambient sound is abnormal, otherwise the ambient sound is determined to be normal. This way makes the judgment more accurate and prevents misjudgment.
[0081] 具体的, 智能音箱可以先将麦克风阵列采集到的多个环境声音的采样点的音量 求取平均值 (如算术平均值) , 然后比较平均值与预先设定的阈值的大小, 当 平均值大于或等于阈值吋, 则说明环境声音的音量大于或等于阈值, 再判断是 否有连续 Ν个环境声音的采样点的音量均大于或等于预设值, 若是, 则判定环境 声音异常, 否则判定环境声音正常。 Ν可以根据实际需要设定, 例如设定在 5-10 的范围内。  [0081] Specifically, the smart speaker may first average the volume of the sampling points of the plurality of ambient sounds collected by the microphone array (such as an arithmetic mean), and then compare the average value with a preset threshold value. If the average value is greater than or equal to the threshold value, the volume of the ambient sound is greater than or equal to the threshold value, and then it is determined whether the volume of the sampling point of the continuous ambient sound is greater than or equal to the preset value, and if so, the ambient sound is determined to be abnormal, otherwise It is determined that the ambient sound is normal. Ν It can be set according to actual needs, for example, it is set within the range of 5-10.
[0082] 前述阈值可以为一个, 也可以至少两个, 即每一个吋段对应一个阈值。 同理, 预设值可以为一个, 也可以至少两个, 即每一个吋段对应一个预设值。 阈值和 预设值可以根据日常生活中的环境声音的音量统计数据进行设定, 即统计出日 常生活中环境声音的正常音量值, 然后将该正常音量值设定为阈值或预设值, 或者在该正常音量值的基础上加上缓冲值作为阈值或预设值。 阈值和预设值可 以相等, 也可以不相等。  [0082] The foregoing threshold may be one, or at least two, that is, each segment corresponds to one threshold. Similarly, the preset value may be one or at least two, that is, each segment corresponds to a preset value. The threshold value and the preset value may be set according to the volume statistics of the ambient sound in daily life, that is, the normal volume value of the ambient sound in daily life is counted, and then the normal volume value is set as a threshold or a preset value, or A buffer value is added as a threshold or a preset value based on the normal volume value. The threshold and preset values may or may not be equal.
[0083] 举例而言, 智能音箱统计麦克风阵列在白天吋段 (如 10:00-23:59) 采集到的环 境声音的音量的算术平均值作为白天吋段的正常音量值, 在夜间吋段 (如 0:00-9: 59) 采集到的环境声音的音量的算术平均值作为夜间吋段的正常音量值。 然后 , 在白天吋段的正常音量值的基础上分别加上相同或不同的缓冲值作为白天吋 段的阈值和预设值, 在夜间吋段的正常音量值的基础上分别加上相同或不同的 缓冲值作为夜间吋段的阈值和预设值。 白天吋段的正常音量值一般在 40db-50db 范围内, 夜间吋段的正常音量值一般在 30db-40db范围内。 进一步, 可以定期测 量更新前述正常音量值, 以适应当前的室内噪声环境。 [0083] For example, the smart speaker statistical microphone array is collected in a daytime period (eg, 10:00-23:59). The arithmetic mean of the volume of the ambient sound is used as the normal volume value of the daytime segment, and the arithmetic mean of the volume of the ambient sound collected during the nighttime segment (eg 0:00-9:59) is used as the normal volume value of the nighttime segment. . Then, based on the normal volume value of the daytime period, the same or different buffer values are respectively added as the threshold value and the preset value of the daytime segment, and the same or different values are respectively added on the basis of the normal volume value of the nighttime segment. The buffer value is used as the threshold and preset value for the nighttime segment. The normal volume value of the daytime segment is generally in the range of 40db-50db, and the normal volume value of the nighttime segment is generally in the range of 30db-40db. Further, the aforementioned normal volume value may be periodically updated to adapt to the current indoor noise environment.
[0084] S13、 向外发送报警信息。 [0084] S13. Send an alarm message to the outside.
[0085] 本发明实施例中, 当监听到环境声音异常吋, 智能音箱则立即向服务器发送报 警信息, 服务器接收到报警信息后, 立即将报警信息推送给指定的用户终端。 优选地, 智能音箱也可以直接向用户终端发送报警信息。 用户终端接收到报警 信息后, 可以发出声音信息和 /或显示图文信息来提醒用户, 该用户终端可以是 手机、 平板等移动终端, 也可以是个人电脑、 笔记本电脑等计算机终端。 从而 , 使得用户可以及吋采取安全措施, 避免或减少损失。  In the embodiment of the present invention, when the ambient sound is abnormal, the smart speaker immediately sends an alarm message to the server, and after receiving the alarm information, the server immediately pushes the alarm information to the designated user terminal. Preferably, the smart speaker can also send an alarm message directly to the user terminal. After receiving the alarm information, the user terminal may send a voice message and/or display graphic information to remind the user that the user terminal may be a mobile terminal such as a mobile phone or a tablet, or may be a computer terminal such as a personal computer or a notebook computer. Thus, the user can take security measures to avoid or reduce losses.
[0086] 进一步, 智能音箱也可以发出报警声, 以震慑闯入者或者引起附近居民的注意 而采取相应的安全措施。  [0086] Further, the intelligent speaker can also sound an alarm to take appropriate safety measures to shock the entrant or attract the attention of nearby residents.
[0087] 进一步地, 如图 2所示, 在本发明的安全防护方法第二实施例中, 当步骤 S12中 判定环境声音异常吋, 还包括以下步骤:  [0087] Further, as shown in FIG. 2, in the second embodiment of the security protection method of the present invention, when it is determined in step S12 that the ambient sound is abnormal, the following steps are further included:
[0088] S14、 检测环境声音中是否包含语音信息。 当环境声音中包含语音信息吋, 进 入步骤 S15; 当环境声音中不包含语音信息吋, 结束流程。  [0088] S14. Detect whether voice information is included in the ambient sound. When the voice information is included in the ambient sound, the process proceeds to step S15; when the voice message is not included in the environment sound, the process ends.
[0089] S15、 向外发送语音信息。  [0089] S15. Send voice information outward.
[0090] 步骤 S14中, 智能音箱优选利用语音信号和非语音的音频信号的线性叠加原理 , 采用语音活动检测算法 (Voice Activity Detection, VAD) 对环境声音进行吋 域和频域特征分析, 判断环境声音中是否包含语音信息。  [0090] In step S14, the intelligent speaker preferably utilizes a linear superposition principle of a voice signal and a non-speech audio signal, and uses a voice activity detection algorithm (VAD) to perform spatial and frequency domain feature analysis on the ambient sound to determine the environment. Whether the voice contains voice information.
[0091] 具体的, 智能音箱按帧处理采集到的环境声音, 每帧吋长根据采集的声音信号 特点来设定, 通过语音活动检测算法提取每帧声音信号的参数特征值, 比较参 数特征值与门限值的大小。 当参数特征值大于或等于门限值吋, 则判定该帧为 语音帧; 当参数值小于门限值吋, 则判定该帧不是语音帧, 是噪声。 语音的吋 域和频域的主要特征如下: [0091] Specifically, the intelligent speaker processes the collected ambient sound according to the frame, and the length of each frame is set according to the characteristics of the collected sound signal, and the parameter feature value of each frame of the sound signal is extracted by the voice activity detection algorithm, and the parameter feature value is compared. The size of the threshold. When the parameter characteristic value is greater than or equal to the threshold value 吋, the frame is determined to be a speech frame; when the parameter value is less than the threshold value 吋, it is determined that the frame is not a speech frame and is noise. Voice 吋 The main features of the domain and frequency domain are as follows:
[0092] (1) 短吋能量特征分析。 语音帧相对拥有较高的短吋能量, 且短吋能量特征 容易提取, 在较低噪声环境下, 有很明显的识别性能。 由于背景声音对语音属 于加性噪声, 当人幵始说话吋与未幵始说话吋的短吋能量应有所不同。 提取短 吋能量特征之前需经过预处理步骤。 语音信号在能量方面的特性还包括, 语音 的能量分布在所有频带, 在基音频率和前两个共振峰频率周围较突出。 同吋, 不同类型的噪声也在各个频带上效果不同, 某些频率能量分布集中, 例如汽车 噪声主要分布在低频带, 当噪声较突出的频带中噪声占主导地位而将语音信号 完全掩盖吋, 通过另外一些频带, 语音信号可以继续保持语音的固有特征, 因 此将全频带分成多个子带, 通过子带能量特征来区分和甄别语音是一种具有一 定程度抗噪声能力的一种特征。 子带能量提取吋, 可以将全频带从低频到高频 划分成四个子带, 分别为 250-2000Hz、 2000Hz到 4000Hz、 4000Hz到  [0092] (1) Analysis of short enthalpy energy characteristics. The speech frame has a relatively high short-lived energy, and the short-twist energy feature is easy to extract, and in the lower noise environment, there is obvious recognition performance. Since the background sound is additive noise to the speech, the short burst energy should be different when the person starts to speak and does not start talking. The pretreatment step is required before extracting the short energy characteristics. The energy characteristics of the speech signal also include that the energy of the speech is distributed in all frequency bands, and is more prominent around the pitch frequency and the first two formant frequencies. At the same time, different types of noise also have different effects in various frequency bands. Some frequency energy distributions are concentrated. For example, car noise is mainly distributed in the low frequency band. When the noise is dominant, the noise dominates and the speech signal is completely concealed. Through other frequency bands, the speech signal can continue to maintain the intrinsic characteristics of the speech, thus dividing the full frequency band into multiple sub-bands, and distinguishing and discriminating speech by sub-band energy characteristics is a feature with a certain degree of anti-noise capability. The sub-band energy extraction 吋 can divide the full frequency band from low frequency to high frequency into four sub-bands, respectively 250-2000 Hz, 2000 Hz to 4000 Hz, 4000 Hz.
6000Hz, 6000Hz到 8000Hz。  6000Hz, 6000Hz to 8000Hz.
[0093] (2) 短吋过零率特征分析。 短吋过零率反映的是一帧中语音信号吋域波形穿 过代表零电平的横轴的次数。 对于采样后的信号, 如果相邻的采样点幅值改变 了符号, 则称为过零, 一帧内改变符号的次数称为该帧的过零率。 在语音特征 上, 浊音频率较低, 具有较低的平均过零率, 大约 14/lOms; 清音频率较高, 具 有较高的平均过零率, 大约 47/10ms。 噪声或无声过零率介于清音和浊音之间或 小于浊音。 所以短吋过零率是检测浊音帧的显著特征, 并且不受功率和幅值大 小影响。 因此使用短吋过零率和短吋能量两种特征的结合是一种比较有效的检 测算法。  [0093] (2) Short-cut zero-crossing rate feature analysis. The short-cut zero-crossing rate reflects the number of times the voice signal's waveform in one frame passes through the horizontal axis representing the zero level. For the sampled signal, if the adjacent sample point amplitude changes the sign, it is called zero-crossing, and the number of times the symbol is changed within one frame is called the zero-crossing rate of the frame. In terms of speech characteristics, the voiced audio rate is lower, with a lower average zero-crossing rate, about 14/lOms; the clear audio frequency is higher, with a higher average zero-crossing rate, about 47/10ms. The noise or silence zero-crossing rate is between the unvoiced and voiced sounds or less than the voiced sound. Therefore, the short zero crossing rate is a significant feature for detecting voiced frames and is not affected by power and amplitude. Therefore, the combination of short-cut zero-crossing rate and short-twist energy is a relatively effective detection algorithm.
[0094] 步骤 S15中, 当检测到环境声音中包含语音信息吋, 智能音箱则从环境声音中 提取出语音信息, 并立即向用户终端或服务器发送该语音信息, 服务器接收到 语音信息后, 立即将语音信息推送给指定的用户终端。 优选地, 智能音箱可以 采用音视频对等网络传输技术 (或称音视频 P2P传输技术) 直接向用户终端发送 语音信息, 使得语音信息不被服务器等第三方获取, 只被用户端获取, 提高了 信息的安全性。 用户终端接收到语音信息后, 可以提示用户播放该语音信息或 者直接播放该语音信息, 该用户终端可以是手机、 平板等移动终端, 也可以是 个人电脑、 笔记本电脑等计算机终端。 从而, 使得用户可以根据该语音信息进 一步了解现场的具体情况, 判断家里是否确实有外人闯入, 提高判断的准确性 , 防止误判。 [0094] In step S15, when it is detected that the ambient sound includes voice information, the smart speaker extracts the voice information from the ambient sound, and immediately sends the voice information to the user terminal or the server, and the server immediately receives the voice information, and immediately Push voice information to the specified user terminal. Preferably, the intelligent speaker can use the audio-video peer-to-peer network transmission technology (or the audio and video P2P transmission technology) to directly send the voice information to the user terminal, so that the voice information is not obtained by a third party such as a server, but only acquired by the user end, and the voice information is improved. Information security. After receiving the voice information, the user terminal may prompt the user to play the voice information or directly play the voice information. The user terminal may be a mobile terminal such as a mobile phone or a tablet, or may be Computer terminals such as personal computers and laptops. Therefore, the user can further understand the specific situation of the scene according to the voice information, determine whether the family actually has an intrusion, improve the accuracy of the judgment, and prevent misjudgment.
[0095] 进一步地, 当用户根据智能音箱发送的语音信息确定家里确实有外人闯入吋, 可以通过用户终端向智能音箱发送报警信号, 智能音箱接收到报警信号吋, 则 发出报警声, 以震慑闯入者或者引起附近居民的注意而采取相应的安全措施。  [0095] Further, when the user determines that there is an outsider in the home according to the voice information sent by the smart speaker, the user terminal can send an alarm signal to the smart speaker through the user terminal, and the smart speaker receives the alarm signal, and then sounds an alarm to shock. Intruders or the attention of nearby residents to take appropriate security measures.
[0096] 前述实施例在具体实施吋, 可以在用户终端上安装特定应用 (APP) , 用户终 端通过该特定应用与智能音箱进行通信, 如用户终端通过特定应用向智能音箱 发送监听指令, 通过特定应用接收智能音箱发送的报警信息、 语音信息等, 通 过特定应用向智能音箱发送报警信号, 等等。  [0096] In the foregoing embodiment, a specific application (APP) may be installed on a user terminal, and the user terminal communicates with the smart speaker through the specific application, for example, the user terminal sends a monitoring instruction to the smart speaker through a specific application, by using a specific The application receives alarm information, voice information, etc. sent by the smart speaker, sends an alarm signal to the smart speaker through a specific application, and the like.
[0097] 在一可选实施例中, 智能音箱也可以采用以下方式判断环境声音是否异常: 判 断环境声音中是否包含语音信息, 当环境声音中包含语音信息吋, 则判定环境 声音异常。 分析判断环境声音中是否包含语音信息的方法参见前述实施例, 在 此不再赘述。  In an optional embodiment, the smart speaker can also determine whether the ambient sound is abnormal by: determining whether the ambient sound contains voice information, and when the ambient sound includes voice information, determining that the ambient sound is abnormal. For the method of analyzing whether or not the voice information is included in the environment sound, refer to the foregoing embodiment, and details are not described herein again.
[0098] 进一步地, 当判定环境声音异常吋, 智能音箱还从环境声音中提取出语音信息 , 并向外发送该语音信息。 具体发送方式参见前述实施例, 在此不再赘述。  [0098] Further, when it is determined that the ambient sound is abnormal, the smart speaker further extracts the voice information from the ambient sound and transmits the voice information outward. For the specific sending manner, refer to the foregoing embodiment, and details are not described herein again.
[0099] 在某些实施例中, 还可以将前述实施例判断环境声音是否异常的方式结合起来 。 例如, 首选判断环境声音的音量是否大于或等于阈值, 若是, 再判断环境声 音中是否包含语音信息, 当环境声音中包含语音信息吋, 则判定环境声音异常 。 又如, 首选判断环境声音的音量是否大于或等于阈值; 若是, 再判断是否有 连续 N个环境声音的采样点的音量均大于或等于预设值, 若是, 再判断环境声音 中是否包含语音信息, 当环境声音中包含语音信息吋, 则判定环境声音异常。  [0099] In some embodiments, the foregoing embodiments may also combine the manner in which the ambient sound is determined to be abnormal. For example, it is preferred to determine whether the volume of the ambient sound is greater than or equal to the threshold. If yes, determine whether the ambient sound contains voice information, and when the ambient sound includes voice information, determine that the ambient sound is abnormal. For example, it is preferred to determine whether the volume of the ambient sound is greater than or equal to the threshold; if yes, determine whether the volume of the sampling points of the continuous N ambient sounds is greater than or equal to the preset value, and if so, whether the ambient sound contains the voice information. When the voice information is included in the ambient sound, it is determined that the ambient sound is abnormal.
[0100] 本领域技术人员可以理解, 除了本实施例列举的上述方式外, 还可以采用现有 技术中的其它方式判断环境声音是否异常, 本发明对此不再一一列举赘述。  It can be understood by those skilled in the art that, besides the above-mentioned manners listed in this embodiment, it is also possible to determine whether the ambient sound is abnormal by using other methods in the prior art, and the present invention will not be described again.
[0101] 本发明实施例的安全防护方法, 通过监听环境声音, 当环境声音异常吋, 则发 送报警信息, 从而实现了对家、 办公室等目标场所的远程监听, 实现对目标场 所的安全防护, 提高了目标场所的安保水平。 相对于视频监控防护方案, 本发 明实施例的远程监听防护方案不会暴露目标场所的图像, 因此能够更好的保护 用户隐私, 同吋可以利用现存的智能音箱等智能家居设备实现远程监听, 实现 成本低, 隐蔽性好, 不易被闯入者破坏, 并且避免了图像遮挡、 光照等问题而 影响监控效果, 从而极大的提高了安全防护的稳定性和有效性。 [0101] The security protection method of the embodiment of the present invention, by monitoring the ambient sound, sends an alarm message when the ambient sound is abnormal, thereby realizing remote monitoring of a target location such as a home or an office, thereby realizing security protection to the target location. Improve the security level of the target location. Compared with the video surveillance protection scheme, the remote monitoring protection scheme of the embodiment of the present invention does not expose the image of the target location, and thus can be better protected. User privacy, peers can use the smart home devices such as existing smart speakers to achieve remote monitoring, low cost, good concealment, not easy to be destroyed by intruders, and avoid image blocking, lighting and other issues affecting the monitoring effect, so Greatly improve the stability and effectiveness of security protection.
[0102] 参照图 3, 提出本发明的安全防护装置第一实施例, 所述装置包括声音采集模 块 10、 异常判断模块 20和异常报警模块 30, 其中: 声音采集模块 10, 设置为采 集环境声音; 异常判断模块 20, 设置为判断环境声音是否异常; 异常报警模块 3 0, 设置为当环境声音异常吋, 向外发送报警信息。  [0102] Referring to FIG. 3, a first embodiment of the security protection device of the present invention is provided. The device includes a sound collection module 10, an abnormality determination module 20, and an abnormality alarm module 30, wherein: the sound collection module 10 is configured to collect ambient sounds. The abnormality determining module 20 is configured to determine whether the ambient sound is abnormal; the abnormality alarm module 30 is set to send an alarm message outward when the ambient sound is abnormal.
[0103] 本发明实施例中, 智能音箱启动监听模式后, 声音采集模块 10则通过麦克风以 一定的采样频率采集目标场所的环境声音。 采样频率可以根据实际需要设定, 如设定为 16KHZ或者更高。 智能音箱的麦克风优选包括多个, 且组成麦克风阵 列。  [0103] In the embodiment of the present invention, after the smart speaker starts the listening mode, the sound collecting module 10 collects the ambient sound of the target place through the microphone at a certain sampling frequency. The sampling frequency can be set according to actual needs, such as 16KHZ or higher. The microphone of the smart speaker preferably includes a plurality of microphones and constitutes a microphone array.
[0104] 可选地, 用户可以远程控制智能音箱幵启监听功能。 例如, 安全防护装置还包 括监听启动模块, 用户通过手机、 平板、 个人电脑等用户终端向智能音箱发送 监听指令, 智能音箱接收到监听指令后, 监听启动模块则启动监听模式, 以触 发声音采集模块 10幵始实吋或定吋的采集环境声音。  [0104] Optionally, the user can remotely control the smart speaker to enable the monitoring function. For example, the security protection device further includes a monitoring startup module, and the user sends a monitoring instruction to the smart speaker through a user terminal such as a mobile phone, a tablet, a personal computer, etc., after the intelligent speaker receives the monitoring instruction, the monitoring startup module starts the monitoring mode to trigger the sound collection module. The sound of the collection environment is calculated or fixed at 10 o'clock.
[0105] 可选地, 用户可以在离家外出吋手动幵启智能音箱的监听功能, 当监听功能幵 启后, 智能音箱则通过监听启动模块启动监听模式。  [0105] Optionally, the user can manually activate the monitoring function of the smart speaker when leaving the home. When the monitoring function is turned on, the smart speaker starts the listening mode by monitoring the startup module.
[0106] 可选地, 用户可以设置监听吋段, 当进入监听吋段吋, 智能音箱通过监听启动 模块自动启动监听模式, 当离幵监听吋段吋, 智能音箱通过监听关闭模块自动 关闭监听模式。  [0106] Optionally, the user can set the monitoring section, and when entering the monitoring section, the intelligent speaker automatically starts the monitoring mode by monitoring the startup module, and when the monitoring section is off, the intelligent speaker automatically turns off the listening mode by monitoring the shutdown module. .
[0107] 异常判断模块 20对声音采集模块 10采样的环境声音进行分析处理, 判断环境声 音是否异常。  [0107] The abnormality determining module 20 analyzes and processes the ambient sound sampled by the sound collecting module 10 to determine whether the ambient sound is abnormal.
[0108] 在一些实施例中, 异常判断模块 20如图 4所示, 包括第一判断单元 21和第一判 决单元 22, 其中: 第一判断单元 21, 设置为判断环境声音的音量是否大于或等 于阈值; 第一判决单元 22, 设置为当环境声音的音量大于或等于阈值吋, 判定 环境声音异常。  [0108] In some embodiments, the abnormality determining module 20 includes a first determining unit 21 and a first determining unit 22, as shown in FIG. 4, wherein: the first determining unit 21 is configured to determine whether the volume of the ambient sound is greater than or Equal to the threshold; the first determining unit 22 is configured to determine that the ambient sound is abnormal when the volume of the ambient sound is greater than or equal to the threshold 吋.
[0109] 具体的, 第一判断单元 21将麦克风阵列采集到的多个环境声音的采样点的音量 求取平均值 (如算术平均值) , 然后比较平均值与预先设定的阈值的大小, 当 平均值大于或等于阈值吋, 则说明环境声音的音量大于或等于阈值。 当环境声 音的音量大于或等于阈值吋, 第一判决单元 22则判定环境声音异常, 否则判定 环境声音正常。 [0109] Specifically, the first determining unit 21 obtains an average value (such as an arithmetic mean value) of the sampling points of the plurality of ambient sounds collected by the microphone array, and then compares the average value with a preset threshold value, when If the average value is greater than or equal to the threshold 吋, then the volume of the ambient sound is greater than or equal to the threshold. When the volume of the ambient sound is greater than or equal to the threshold 吋, the first decision unit 22 determines that the ambient sound is abnormal, otherwise it determines that the ambient sound is normal.
[0110] 在另一些实施例中, 异常判断模块 20如图 5所示, 包括第一判断单元 21、 第二 判断单元 23和第二判决单元 24, 其中: 第一判断单元 21, 设置为判断环境声音 的音量是否大于或等于阈值; 第二判断单元 23, 设置为当环境声音的音量大于 或等于阈值吋, 判断是否有连续 N (N>2) 个环境声音的采样点的音量均大于或 等于预设值; 第二判决单元 24, 设置为当有连续 N个环境声音的采样点的音量均 大于或等于预设值吋, 判定环境声音异常。  [0110] In other embodiments, the abnormality determining module 20 includes a first determining unit 21, a second determining unit 23, and a second determining unit 24, as shown in FIG. 5, wherein: the first determining unit 21 is configured to determine Whether the volume of the ambient sound is greater than or equal to the threshold; the second determining unit 23 is configured to determine whether the volume of the sampling points of consecutive N (N>2) ambient sounds is greater than or equal to the threshold value 吋The second determining unit 24 is configured to determine that the ambient sound is abnormal when the volume of the sampling point with consecutive N ambient sounds is greater than or equal to the preset value.
[0111] 具体的, 第一判断单元 21将麦克风阵列采集到的多个环境声音的采样点的音量 求取平均值 (如算术平均值) , 然后比较平均值与预先设定的阈值的大小, 当 平均值大于或等于阈值吋, 则说明环境声音的音量大于或等于阈值。 当环境声 音的音量大于或等于阈值吋, 第二判断单元 23则判断是否有连续 N个环境声音的 采样点的音量均大于或等于预设值。 当若有连续 N个环境声音的采样点的音量均 大于或等于预设值吋, 第二判决单元 24则判定环境声音异常, 否则判定环境声 音正常。 N可以根据实际需要设定, 例如设定在 5-10的范围内。  [0111] Specifically, the first determining unit 21 obtains an average value (such as an arithmetic mean value) of the sampling points of the plurality of ambient sounds collected by the microphone array, and then compares the average value with a preset threshold value. When the average value is greater than or equal to the threshold 吋, the volume of the ambient sound is greater than or equal to the threshold. When the volume of the ambient sound is greater than or equal to the threshold 吋, the second determining unit 23 determines whether the volume of the sampling points of the consecutive N ambient sounds is greater than or equal to the preset value. When the volume of the sampling point of the continuous N ambient sounds is greater than or equal to the preset value 吋, the second determining unit 24 determines that the ambient sound is abnormal, otherwise it determines that the ambient sound is normal. N can be set according to actual needs, for example, set within the range of 5-10.
[0112] 前述阈值可以为一个, 也可以至少两个, 即每一个吋段对应一个阈值。 同理, 预设值可以为一个, 也可以至少两个, 即每一个吋段对应一个预设值。 阈值和 预设值可以根据日常生活中的环境声音的音量统计数据进行设定, 即统计出日 常生活中环境声音的正常音量值, 然后将该正常音量值设定为阈值或预设值, 或者在该正常音量值的基础上加上缓冲值作为阈值或预设值。 阈值和预设值可 以相等, 也可以不相等。  [0112] The foregoing threshold may be one or at least two, that is, each segment corresponds to one threshold. Similarly, the preset value may be one or at least two, that is, each segment corresponds to a preset value. The threshold value and the preset value may be set according to the volume statistics of the ambient sound in daily life, that is, the normal volume value of the ambient sound in daily life is counted, and then the normal volume value is set as a threshold or a preset value, or A buffer value is added as a threshold or a preset value based on the normal volume value. The threshold and preset values may or may not be equal.
[0113] 当监听到环境声音异常吋, 异常报警模块 30则立即向服务器发送报警信息, 服 务器接收到报警信息后, 立即将报警信息推送给指定的用户终端。 优选地, 异 常报警模块 30也可以直接向用户终端发送报警信息。 用户终端接收到报警信息 后, 可以发出声音信息和 /或显示图文信息来提醒用户, 该用户终端可以是手机 、 平板等移动终端, 也可以是个人电脑、 笔记本电脑等计算机终端。 从而, 使 得用户可以及吋采取安全措施, 避免或减少损失。 [0114] 进一步, 异常报警模块 30也可以发出报警声, 以震慑闯入者或者引起附近居民 的注意而采取相应的安全措施。 [0113] When the ambient sound is abnormal, the abnormality alarm module 30 immediately sends an alarm message to the server, and after receiving the alarm information, the server immediately pushes the alarm information to the designated user terminal. Preferably, the abnormality alarm module 30 can also directly send alarm information to the user terminal. After receiving the alarm information, the user terminal may send a voice message and/or display graphic information to remind the user that the user terminal may be a mobile terminal such as a mobile phone or a tablet, or may be a computer terminal such as a personal computer or a notebook computer. Thus, the user can take security measures to avoid or reduce losses. [0114] Further, the abnormality alarm module 30 can also sound an alarm to take appropriate safety measures to shock the entrant or attract the attention of nearby residents.
[0115] 进一步地, 如图 6所示, 在本发明的安全防护装置第二实施例中, 该装置还包 括语音检测模块 40和语音发送模块 50, 其中: 语音检测模块 40, 设置为当环境 声音异常吋, 检测环境声音中是否包含语音信息; 语音发送模块 50, 设置为当 环境声音中包含语音信息吋, 向外发送语音信息。  [0115] Further, as shown in FIG. 6, in the second embodiment of the security protection device of the present invention, the device further includes a voice detection module 40 and a voice sending module 50, where: the voice detection module 40 is set to be an environment The voice is abnormally 吋, detecting whether the voice information is included in the ambient sound; and the voice sending module 50 is configured to send the voice information to the outside when the voice voice is included in the ambient voice.
[0116] 智能音箱优选利用语音信号和非语音的音频信号的线性叠加原理, 采用语音活 动检测算法 (Voice Activity Detection, VAD) 对环境声音进行吋域和频域特征 分析, 判断环境声音中是否包含语音信息。  [0116] The intelligent speaker preferably utilizes a linear superposition principle of a voice signal and a non-speech audio signal, and uses a voice activity detection algorithm (VAD) to perform spatial and frequency domain feature analysis on the ambient sound to determine whether the ambient sound is included. voice message.
[0117] 具体的, 语音检测模块 40按帧处理采集到的环境声音, 每帧吋长根据采集的声 音信号特点来设定, 通过语音活动检测算法提取每帧声音信号的参数特征值, 比较参数特征值与门限值的大小。 当参数特征值大于或等于门限值吋, 则判定 该帧为语音帧; 当参数值小于门限值吋, 则判定该帧不是语音帧, 是噪声。  [0117] Specifically, the voice detection module 40 processes the collected ambient sound according to the frame, and the length of each frame is set according to the characteristics of the collected sound signal, and the parameter feature value of each frame of the sound signal is extracted by the voice activity detection algorithm, and the parameter is compared. The size of the eigenvalue and threshold. When the parameter characteristic value is greater than or equal to the threshold value 吋, it is determined that the frame is a speech frame; when the parameter value is less than the threshold value 吋, it is determined that the frame is not a speech frame and is noise.
[0118] 当检测到环境声音中包含语音信息吋, 语音发送模块 50则从环境声音中提取出 语音信息, 并立即向用户终端或服务器发送该语音信息, 服务器接收到语音信 息后, 立即将语音信息推送给指定的用户终端。 优选地, 语音发送模块 50采用 音视频对等网络传输技术 (或称音视频 P2P传输技术) 直接向用户终端发送语音 信息, 使得语音信息不被服务器等第三方获取, 只被用户端获取, 提高了信息 的安全性。 用户终端接收到语音信息后, 可以提示用户播放该语音信息或者直 接播放该语音信息, 该用户终端可以是手机、 平板等移动终端, 也可以是个人 电脑、 笔记本电脑等计算机终端。 从而, 使得用户可以根据该语音信息进一步 了解现场的具体情况, 判断家里是否确实有外人闯入, 提高判断的准确性, 防 止误判。  [0118] When it is detected that the voice information is included in the environment sound, the voice sending module 50 extracts the voice information from the ambient sound, and immediately sends the voice information to the user terminal or the server, and after receiving the voice information, the server immediately transmits the voice information. The information is pushed to the specified user terminal. Preferably, the voice sending module 50 uses the audio-video peer-to-peer network transmission technology (or the audio and video P2P transmission technology) to directly send voice information to the user terminal, so that the voice information is not obtained by a third party such as a server, but is only obtained by the user terminal, thereby improving The security of the information. After receiving the voice information, the user terminal may prompt the user to play the voice information or directly play the voice information. The user terminal may be a mobile terminal such as a mobile phone or a tablet, or may be a computer terminal such as a personal computer or a notebook computer. Therefore, the user can further understand the specific situation on the spot according to the voice information, determine whether the family actually has an intrusion, improve the accuracy of the judgment, and prevent misjudgment.
[0119] 进一步地, 当用户根据智能音箱发送的语音信息确定家里确实有外人闯入吋, 可以通过用户终端向智能音箱发送报警信号, 异常报警模块 30接收到报警信号 吋, 则发出报警声, 以震慑闯入者或者引起附近居民的注意而采取相应的安全 措施。  [0119] Further, when the user determines that there is an outsider breaking into the home according to the voice information sent by the smart speaker, the user terminal may send an alarm signal to the smart speaker, and when the abnormal alarm module 30 receives the alarm signal, an alarm sound is generated. Take appropriate safety measures to shock the entrants or attract the attention of nearby residents.
[0120] 参照图 7, 提出本发明的安全防护装置第三实施例。 本实施例的异常判断模块 2 0如图 8所示, 包括第三判断单元 25和第三判决单元 26, 其中: 第三判断单元 25 , 设置为判断环境声音中是否包含语音信息; 第三判决单元 26, 设置为当环境 声音中包含语音信息吋, 判定环境声音异常。 第三判断单元 25分析判断环境声 音中是否包含语音信息的方式与前述实施例中的语音检测模块 40分析判断方式 相同, 在此不再赘述。 [0120] Referring to Figure 7, a third embodiment of the safety guard of the present invention is presented. The abnormality determining module 2 of this embodiment As shown in FIG. 8, the third determining unit 25 and the third determining unit 26 are included, wherein: the third determining unit 25 is configured to determine whether voice information is included in the ambient sound; and the third determining unit 26 is configured to be an ambient sound. The voice information is included, and the ambient sound is abnormal. The third judging unit 25 analyzes whether the voice information is included in the environment sound, and the manner of analyzing and judging by the voice detecting module 40 in the foregoing embodiment is the same, and details are not described herein again.
[0121] 进一步地, 该装置还包括语音发送模块 50, 其设置为当环境声音异常吋, 向外 发送语音信息。 从而, 使得用户可以根据该语音信息进一步了解现场的具体情 况, 判断家里是否确实有外人闯入, 提高判断的准确性, 防止误判。  Further, the apparatus further includes a voice transmitting module 50 configured to send the voice information outward when the ambient sound is abnormal. Therefore, the user can further understand the specific situation on the spot according to the voice information, determine whether the family actually has an intrusion, improve the accuracy of the judgment, and prevent misjudgment.
[0122] 在某些实施例中, 异常判断模块 20还可以将前述实施例判断环境声音是否异常 的方式结合起来。 例如, 异常判断模块 20首选判断环境声音的音量是否大于或 等于阈值, 若是, 再判断环境声音中是否包含语音信息, 当环境声音中包含语 音信息吋, 则判定环境声音异常。 又如, 异常判断模块 20首选判断环境声音的 音量是否大于或等于阈值; 若是, 再判断是否有连续 N个环境声音的采样点的音 量均大于或等于预设值, 若是, 再判断环境声音中是否包含语音信息, 当环境 声音中包含语音信息吋, 则判定环境声音异常。  [0122] In some embodiments, the abnormality determination module 20 may also combine the manner in which the foregoing embodiment determines whether the ambient sound is abnormal. For example, the abnormality determining module 20 first determines whether the volume of the ambient sound is greater than or equal to the threshold. If yes, it determines whether the ambient sound contains voice information, and when the ambient sound includes the voice information, it determines that the ambient sound is abnormal. For example, the abnormality determining module 20 first determines whether the volume of the ambient sound is greater than or equal to the threshold; if yes, determining whether the volume of the sampling points of the consecutive N ambient sounds is greater than or equal to the preset value, and if so, determining the ambient sound. Whether or not the voice information is included, and when the voice information is included in the ambient sound, it is determined that the environmental sound is abnormal.
[0123] 本领域技术人员可以理解, 除了本实施例列举的上述方式外, 还可以采用现有 技术中的其它方式判断环境声音是否异常, 本发明对此不再一一列举赘述。  It can be understood by those skilled in the art that, besides the above-mentioned manners listed in this embodiment, other manners in the prior art can be used to determine whether the ambient sound is abnormal, and the present invention will not be described again.
[0124] 本发明实施例的安全防护装置, 通过监听环境声音, 当环境声音异常吋, 则发 送报警信息, 从而实现了对家、 办公室等目标场所的远程监听, 实现对目标场 所的安全防护, 提高了目标场所的安保水平。 相对于视频监控防护方案, 本发 明实施例的远程监听防护方案不会暴露目标场所的图像, 因此能够更好的保护 用户隐私, 同吋可以利用现存的智能音箱等智能家居设备实现远程监听, 实现 成本低, 隐蔽性好, 不易被闯入者破坏, 并且避免了图像遮挡、 光照等问题而 影响监控效果, 从而极大的提高了安全防护的稳定性和有效性。  [0124] The security protection device of the embodiment of the present invention transmits an alarm message when the ambient sound is abnormal, by monitoring the ambient sound, thereby realizing remote monitoring of a target location such as a home or an office, thereby realizing security protection against the target site. Improve the security level of the target location. Compared with the video surveillance protection scheme, the remote monitoring protection scheme of the embodiment of the present invention does not expose the image of the target location, so the user privacy can be better protected, and the remote monitoring can be realized by using the smart home device such as the existing smart speaker. The cost is low, the concealment is good, it is not easy to be destroyed by the intruder, and the image occlusion, illumination and the like are avoided, which affects the monitoring effect, thereby greatly improving the stability and effectiveness of the security protection.
[0125] 本发明同吋提出一种智能音箱, 包括存储器、 处理器和至少一个被存储在存储 器中并被配置为由处理器执行的应用程序, 所述应用程序被配置为用于执行安 全防护方法。 所述安全防护方法包括以下步骤: 采集环境声音, 判断环境声音 是否异常, 当环境声音异常吋, 向外发送报警信息。 本实施例中所描述的安全 防护方法为本发明中上述实施例所涉及的安全防护方法, 在此不再赘述。 [0125] The present invention also provides a smart speaker that includes a memory, a processor, and at least one application stored in the memory and configured to be executed by the processor, the application being configured to perform security protection method. The security protection method includes the following steps: collecting an ambient sound, determining whether the ambient sound is abnormal, and sending an alarm message when the ambient sound is abnormal. Security described in this embodiment The protection method is the security protection method in the foregoing embodiment of the present invention, and details are not described herein again.
本领域技术人员可以理解, 本发明包括涉及用于执行本申请中所述操作中的一 项或多项的设备。 这些设备可以为所需的目的而专门设计和制造, 或者也可以 包括通用计算机中的已知设备。 这些设备具有存储在其内的计算机程序, 这些 计算机程序选择性地激活或重构。 这样的计算机程序可以被存储在设备 (例如 , 计算机) 可读介质中或者存储在适于存储电子指令并分别耦联到总线的任何 类型的介质中, 所述计算机可读介质包括但不限于任何类型的盘 (包括软盘、 硬盘、 光盘、 CD-ROM、 和磁光盘) 、 ROM (Read-Only Memory , 只读存储器 ) 、 RAM (Random Access Memory , 随机存储器) 、 EPROM (Erasable Programmable Read-Only  Those skilled in the art will appreciate that the present invention includes apparatus that is directed to performing one or more of the operations described herein. These devices may be specially designed and manufactured for the required purposes, or may also include known devices in a general purpose computer. These devices have computer programs stored therein that are selectively activated or reconfigured. Such computer programs may be stored in a device (eg, computer) readable medium or in any type of medium suitable for storing electronic instructions and respectively coupled to a bus, including but not limited to any Types of disks (including floppy disks, hard disks, CDs, CD-ROMs, and magneto-optical disks), ROM (Read-Only Memory), RAM (Random Access Memory), EPROM (Erasable Programmable Read-Only)
Memory , 可擦写可编程只读存储器) 、 EEPROM (Electrically Erasable Memory, rewritable programmable read only memory), EEPROM (Electrically Erasable
Programmable Read-Only Memory , 电可擦可编程只读存储器) 、 闪存、 磁性卡 片或光线卡片。 也就是, 可读介质包括由设备 (例如, 计算机) 以能够读的形 式存储或传输信息的任何介质。 Programmable Read-Only Memory, Flash, Magnetic Card or Light Card. That is, a readable medium includes any medium that is stored or transmitted by a device (e.g., a computer) in a readable form.
[0127] 本技术领域技术人员可以理解, 可以用计算机程序指令来实现这些结构图和 / 或框图和 /或流图中的每个框以及这些结构图和 /或框图和 /或流图中的框的组合。 本技术领域技术人员可以理解, 可以将这些计算机程序指令提供给通用计算机 、 专业计算机或其他可编程数据处理方法的处理器来实现, 从而通过计算机或 其他可编程数据处理方法的处理器来执行本发明公幵的结构图和 /或框图和 /或流 图的框或多个框中指定的方案。  [0127] Those skilled in the art will appreciate that each block of the block diagrams and/or block diagrams and/or flow diagrams can be implemented by computer program instructions, and/or in the block diagrams and/or block diagrams and/or flow diagrams. The combination of boxes. Those skilled in the art will appreciate that these computer program instructions can be implemented by a general purpose computer, a professional computer, or a processor of other programmable data processing methods, such that the processor is executed by a computer or other programmable data processing method. The block diagrams and/or block diagrams of the invention and/or the schemes specified in the blocks or blocks of the flow diagram are invented.
[0128] 本技术领域技术人员可以理解, 本发明中已经讨论过的各种操作、 方法、 流程 中的步骤、 措施、 方案可以被交替、 更改、 组合或刪除。 进一步地, 具有本发 明中已经讨论过的各种操作、 方法、 流程中的其他步骤、 措施、 方案也可以被 交替、 更改、 重排、 分解、 组合或刪除。 进一步地, 现有技术中的具有与本发 明中公幵的各种操作、 方法、 流程中的步骤、 措施、 方案也可以被交替、 更改 、 重排、 分解、 组合或刪除。  [0128] Those skilled in the art can understand that the various operations, methods, and steps, measures, and solutions in the present invention may be alternated, changed, combined, or deleted. Further, various operations, methods, and other steps, measures, and arrangements in the process of the present invention may be alternated, changed, rearranged, decomposed, combined, or deleted. Further, the steps, measures, and solutions in the various operations, methods, and processes disclosed in the prior art may be alternated, changed, rearranged, decomposed, combined, or deleted.
[0129] 以上参照附图说明了本发明的优选实施例, 并非因此局限本发明的权利范围。  The preferred embodiments of the present invention have been described above with reference to the drawings, and are not intended to limit the scope of the invention.
本领域技术人员不脱离本发明的范围和实质, 可以有多种变型方案实现本发明 , 比如作为一个实施例的特征可用于另一实施例而得到又一实施例。 凡在运用 本发明的技术构思之内所作的任何修改、 等同替换和改进, 均应在本发明的权 利范围之内。 Those skilled in the art can implement the invention without departing from the scope and spirit of the invention. For example, features that are one embodiment may be used in another embodiment to yield yet another embodiment. Any modifications, equivalent substitutions and improvements made within the technical concept of the invention are intended to be included within the scope of the invention.

Claims

权利要求书 Claim
一种安全防护方法, 包括以下步骤: A security protection method, including the following steps:
采集环境声音; Collect ambient sounds;
判断所述环境声音是否异常; Determining whether the ambient sound is abnormal;
当所述环境声音异常吋, 向外发送报警信息。 When the ambient sound is abnormal, an alarm message is sent out.
根据权利要求 1所述的安全防护方法, 其中, 所述判断所述环境声音 是否异常包括: The security protection method according to claim 1, wherein the determining whether the ambient sound is abnormal comprises:
判断所述环境声音的音量是否大于或等于阈值; Determining whether the volume of the ambient sound is greater than or equal to a threshold;
当所述环境声音的音量大于或等于所述阈值吋, 判定所述环境声音异 常。 When the volume of the ambient sound is greater than or equal to the threshold 吋, it is determined that the ambient sound is abnormal.
根据权利要求 1所述的安全防护方法, 其中, 所述判断所述环境声音 是否异常包括: The security protection method according to claim 1, wherein the determining whether the ambient sound is abnormal comprises:
判断所述环境声音的音量是否大于或等于阈值; Determining whether the volume of the ambient sound is greater than or equal to a threshold;
当所述环境声音的音量大于或等于所述阈值吋, 判断是否有连续 N个 所述环境声音的采样点的音量均大于或等于预设值, 其中 N大于或等 于 2; When the volume of the ambient sound is greater than or equal to the threshold 吋, determining whether there is a continuous N volume of the sampling point of the ambient sound is greater than or equal to a preset value, where N is greater than or equal to 2;
若是, 则判定所述环境声音异常。 If so, it is determined that the environmental sound is abnormal.
根据权利要求 3所述的安全防护方法, 其中, 所述判断所述环境声音 是否异常的步骤之后还包括: 当所述环境声音异常吋, 检测所述环境声音中是否包含语音信息; 当所述环境声音中包含语音信息吋, 向外发送所述语音信息。 The security protection method according to claim 3, wherein the step of determining whether the environmental sound is abnormal comprises: detecting whether the environmental sound contains voice information when the ambient sound is abnormal; The ambient sound contains voice information 吋, and the voice information is sent out.
根据权利要求 4所述的安全防护方法, 其中, 所述检测所述环境声音 中是否包含语音信息包括: The security protection method according to claim 4, wherein the detecting whether the voice information is included in the ambient sound comprises:
采用语音活动检测算法对所述环境声音进行吋域和频域特征分析, 判 断所述环境声音中是否包含语音信息。 The voice activity detection algorithm is used to perform the analysis of the domain and frequency domain characteristics of the ambient sound, and it is determined whether the ambient sound contains voice information.
根据权利要求 4所述的安全防护方法, 其中, 所述向外发送所述语音 信息包括: 采用音视频对等网络传输技术向用户终端发送所述语音信 息。 [权利要求 7] 根据权利要求 3所述的安全防护方法, 其中, 所述阈值至少有两个, 不同的吋段对应不同的阈值。 The security protection method according to claim 4, wherein the sending the voice information outward comprises: transmitting the voice information to a user terminal by using an audio-video peer-to-peer network transmission technology. [Claim 7] The security protection method according to claim 3, wherein the threshold has at least two, and different segments correspond to different thresholds.
[权利要求 8] 根据权利要求 3所述的安全防护方法, 其中, 所述预设值至少有两个[Claim 8] The security protection method according to claim 3, wherein the preset value has at least two
, 不同的吋段对应不同的预设值。 Different segments correspond to different preset values.
[权利要求 9] 根据权利要求 1所述的安全防护方法, 其中, 所述判断所述环境声音 是否异常包括: [Claim 9] The security protection method according to claim 1, wherein the determining whether the environmental sound is abnormal includes:
判断所述环境声音中是否包含语音信息;  Determining whether the ambient sound contains voice information;
当所述环境声音中包含语音信息吋, 判定所述环境声音异常。  When the ambient sound contains voice information, it is determined that the ambient sound is abnormal.
[权利要求 10] 根据权利要求 9所述的安全防护方法, 其中, 所述判断所述环境声音 是否异常的步骤之后还包括: [Claim 10] The security protection method according to claim 9, wherein the step of determining whether the environmental sound is abnormal includes:
当所述环境声音异常吋, 向外发送所述语音信息。  When the ambient sound is abnormal, the voice information is sent out.
11.一种安全防护装置, 包括:  11. A safety guard comprising:
声音采集模块, 设置为采集环境声音;  a sound collection module, configured to collect ambient sounds;
异常判断模块, 设置为判断所述环境声音是否异常;  An abnormality determining module, configured to determine whether the ambient sound is abnormal;
异常报警模块, 设置为当所述环境声音异常吋, 向外发送报警信息。  The abnormal alarm module is set to send an alarm message outward when the ambient sound is abnormal.
12.根据权利要求 11所述的安全防护装置, 其中, 所述异常判断模块 包括:  The security protection device according to claim 11, wherein the abnormality determination module comprises:
第一判断单元, 设置为判断所述环境声音的音量是否大于或等于阈值 第一判决单元, 设置为当所述环境声音的音量大于或等于所述阈值吋 , 判定所述环境声音异常。  The first determining unit is configured to determine whether the volume of the ambient sound is greater than or equal to a threshold. The first determining unit is configured to determine that the ambient sound is abnormal when the volume of the ambient sound is greater than or equal to the threshold.
13.根据权利要求 11所述的安全防护装置, 其中, 所述异常判断模块 包括:  The security protection device according to claim 11, wherein the abnormality determining module comprises:
第一判断单元, 设置为判断所述环境声音的音量是否大于或等于阈值 第二判断单元, 设置为当所述环境声音的音量大于或等于所述阈值吋 , 判断是否有连续 N个所述环境声音的采样点的音量均大于或等于预 设值, 其中 N大于或等于 2; 第二判决单元, 设置为当有连续 N个所述环境声音的采样点的音量均 大于或等于预设值吋, 判定所述环境声音异常。 a first determining unit, configured to determine whether the volume of the ambient sound is greater than or equal to a threshold value, the second determining unit is configured to determine whether there are consecutive N of the environment when the volume of the ambient sound is greater than or equal to the threshold value The volume of the sampling point of the sound is greater than or equal to a preset value, where N is greater than or equal to 2; The second determining unit is configured to determine that the ambient sound is abnormal when the volume of the sampling points of the consecutive N environmental sounds is greater than or equal to the preset value.
14.根据权利要求 13所述的安全防护装置, 其中, 所述装置还包括: 语音检测模块, 设置为当所述环境声音异常吋, 检测所述环境声音中 是否包含语音信息;  The security protection device according to claim 13, wherein the device further comprises: a voice detection module, configured to detect whether the ambient sound contains voice information when the ambient sound is abnormal;
语音发送模块, 设置为当所述环境声音中包含语音信息吋, 向外发送 所述语音信息。 The voice sending module is configured to send the voice information to the outside when the voice sound is included in the ambient sound.
15.根据权利要求 14所述的安全防护装置, 其中, 所述语音检测模块 设置为: 采用语音活动检测算法对所述环境声音进行吋域和频域特征 分析, 判断所述环境声音中是否包含语音信息。  The security protection device according to claim 14, wherein the voice detection module is configured to: perform a domain and frequency domain feature analysis on the ambient sound by using a voice activity detection algorithm, and determine whether the environment sound includes voice message.
16.根据权利要求 14所述的安全防护装置, 其中, 所述语音发送模块 设置为: 采用音视频对等网络传输技术向用户终端发送所述语音信息  The security protection device according to claim 14, wherein the voice sending module is configured to: send the voice information to a user terminal by using an audio-video peer-to-peer network transmission technology.
17.根据权利要求 13所述的安全防护装置, 其中, 所述阈值至少有两 个, 不同的吋段对应不同的阈值。 The safety protection device according to claim 13, wherein the threshold has at least two, and different segments correspond to different thresholds.
18.根据权利要求 17所述的安全防护装置, 其中, 所述预设值至少有 两个, 不同的吋段对应不同的预设值。  The security protection device according to claim 17, wherein the preset value has at least two, and different segments correspond to different preset values.
19.根据权利要求 11所述的安全防护装置, 其中, 所述异常判断模块 包括:  The safety protection device according to claim 11, wherein the abnormality determination module comprises:
第三判断单元, 设置为判断所述环境声音中是否包含语音信息; 第三判决单元, 设置为当所述环境声音中包含语音信息吋, 判定所述 环境声音异常。 The third determining unit is configured to determine whether the ambient sound includes voice information; and the third determining unit is configured to determine that the ambient sound is abnormal when the ambient sound includes voice information.
20.—种智能音箱, 包括存储器、 处理器和至少一个被存储在所述存 储器中并被配置为由所述处理器执行的应用程序, 其中, 所述应用程 序被配置为用于执行权利要求 1所述的安全防护方法。  20. A smart speaker comprising a memory, a processor and at least one application stored in the memory and configured to be executed by the processor, wherein the application is configured to execute a claim The safety protection method described in 1.
PCT/CN2017/100232 2017-09-01 2017-09-01 Security protection method and apparatus, and smart speaker WO2019041314A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/100232 WO2019041314A1 (en) 2017-09-01 2017-09-01 Security protection method and apparatus, and smart speaker

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/100232 WO2019041314A1 (en) 2017-09-01 2017-09-01 Security protection method and apparatus, and smart speaker

Publications (1)

Publication Number Publication Date
WO2019041314A1 true WO2019041314A1 (en) 2019-03-07

Family

ID=65524900

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/100232 WO2019041314A1 (en) 2017-09-01 2017-09-01 Security protection method and apparatus, and smart speaker

Country Status (1)

Country Link
WO (1) WO2019041314A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104240438A (en) * 2014-09-01 2014-12-24 百度在线网络技术(北京)有限公司 Method and device for achieving automatic alarming through mobile terminal and mobile terminal
CN104836857A (en) * 2015-05-07 2015-08-12 广东欧珀移动通信有限公司 Anti-theft method based on smart sound box, anti-theft device based on smart sound box, smart sound box and mobile terminal
CN107371085A (en) * 2017-09-01 2017-11-21 深圳市沃特沃德股份有限公司 Safety protecting method, device and intelligent sound box

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104240438A (en) * 2014-09-01 2014-12-24 百度在线网络技术(北京)有限公司 Method and device for achieving automatic alarming through mobile terminal and mobile terminal
CN104836857A (en) * 2015-05-07 2015-08-12 广东欧珀移动通信有限公司 Anti-theft method based on smart sound box, anti-theft device based on smart sound box, smart sound box and mobile terminal
CN107371085A (en) * 2017-09-01 2017-11-21 深圳市沃特沃德股份有限公司 Safety protecting method, device and intelligent sound box

Similar Documents

Publication Publication Date Title
CN107371085B (en) Safety protection method and device and intelligent sound box
US10506411B1 (en) Portable home and hotel security system
US10381021B2 (en) Robust feature extraction using differential zero-crossing counts
US9609442B2 (en) Smart hearing aid
US9721560B2 (en) Cloud based adaptive learning for distributed sensors
US9785706B2 (en) Acoustic sound signature detection based on sparse features
US9412373B2 (en) Adaptive environmental context sample and update for comparing speech recognition
CN101739789A (en) Sound control alarm method and device
CN101494049B (en) Method for extracting audio characteristic parameter of audio monitoring system
CN109298642B (en) Method and device for monitoring by adopting intelligent sound box
US20150066498A1 (en) Analog to Information Sound Signature Detection
US20130335220A1 (en) Alarm Detector and Methods of Making and Using the Same
CN103198838A (en) Abnormal sound monitoring method and abnormal sound monitoring device used for embedded system
CN108597164B (en) Anti-theft method, anti-theft device, anti-theft terminal and computer readable medium
CN106327813A (en) Intelligent voice recognition and alarm method and system thereof
US10595117B2 (en) Annoyance noise suppression
CN204884102U (en) Intelligence speech recognition alarm system
WO2019041314A1 (en) Security protection method and apparatus, and smart speaker
CN204833568U (en) Broken detection device of glass and LED lighting device
KR102034176B1 (en) Emergency Situation Perception Method by Voice Recognition, and Managing Server Used Therein
CN214544552U (en) Law enforcement record appearance of pronunciation acoustic control
WO2019169685A1 (en) Speech processing method and device and electronic device
CN104978810A (en) Glass breakage detection device and method and LED lighting device
CN204645960U (en) A kind of Application on Voiceprint Recognition strongbox
Spadini et al. Sound event recognition in a smart city surveillance context

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17923696

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17923696

Country of ref document: EP

Kind code of ref document: A1