CN112992136A - Intelligent infant monitoring system and method - Google Patents

Intelligent infant monitoring system and method Download PDF

Info

Publication number
CN112992136A
CN112992136A CN202011491638.1A CN202011491638A CN112992136A CN 112992136 A CN112992136 A CN 112992136A CN 202011491638 A CN202011491638 A CN 202011491638A CN 112992136 A CN112992136 A CN 112992136A
Authority
CN
China
Prior art keywords
unit
unique code
server
local server
voice signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011491638.1A
Other languages
Chinese (zh)
Inventor
吕春苗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Call Shanghai Cloud Computing Co ltd
Original Assignee
Call Shanghai Cloud Computing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Call Shanghai Cloud Computing Co ltd filed Critical Call Shanghai Cloud Computing Co ltd
Priority to CN202011491638.1A priority Critical patent/CN112992136A/en
Publication of CN112992136A publication Critical patent/CN112992136A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/63Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for estimating an emotional state
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Child & Adolescent Psychology (AREA)
  • General Health & Medical Sciences (AREA)
  • Hospice & Palliative Care (AREA)
  • Psychiatry (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses an intelligent infant monitoring system, which comprises: the system comprises a sound acquisition device, a local server and a third-party server; the sound acquisition device is used for acquiring environmental sound and forming a voice signal; the local server comprises a storage unit, a communication unit, a verification unit and a decision unit; the storage unit is used for reading the sound acquisition device and storing the voice signals, and the communication unit is used for uploading the voice signals to the third-party server and receiving feedback data of the third-party server; the checking unit is used for starting the decision unit when the feedback data is a preset unique code; the decision unit is used for dialing a prestored telephone number when the unique code is expired; the third-party server comprises a receiving unit and an identification unit; the receiving unit is used for receiving the voice signal uploaded by the local server, the identifying unit is used for identifying whether the voice signal contains the crying of the child, and when the voice signal contains the crying of the child, the preset unique code is used as feedback data to be sent to the local server. The invention can reduce the load of the server and accelerate the monitoring response speed of the crying of the baby.

Description

Intelligent infant monitoring system and method
Technical Field
The invention belongs to the technical field of infant care equipment, and particularly relates to an intelligent infant monitoring system and an intelligent infant monitoring method based on the system.
Background
With the advancement of technology, various intelligent devices for infant monitoring are continuously available. Existing baby monitors mainly include a baby-end monitor and a control-end display. The working process is as follows: the nursing device is placed in the room of the baby, and the safety of the baby is monitored at any time through the display. The monitoring end part is provided with a camera which is placed in a room where the baby is positioned for shooting the video of the baby, and is provided with a built-in microphone which sends out a signal through an antenna on the nursing device, then a display receives the signal, and a parent can see the movement of the baby through the display at the receiving end. The problems with this solution are: the decibel recognition speed is slow, and the burden on a server is large. Therefore, how to develop a new intelligent infant monitoring system to overcome the above problems is the direction of research needed by those skilled in the art.
Disclosure of Invention
The invention aims to provide an intelligent infant monitoring system which can reduce the burden of a server and accelerate the monitoring response speed of crying and screaming of infants.
The technical scheme is as follows:
an intelligent infant monitoring system, comprising: the system comprises a sound acquisition device, a local server and a third-party server; the voice acquisition device is used for acquiring environmental voice and forming a voice signal; the local server comprises a storage unit, a communication unit, a verification unit and a decision unit; the storage unit is used for reading the sound acquisition device and storing the voice signal; the communication unit is used for realizing data interaction between the local server and the third-party server, uploading voice signals to the third-party server and receiving feedback data of the third-party server; the checking unit is used for reading the feedback data and starting the decision unit when the feedback data is a preset unique code; the decision unit is used for comparing whether the unique code is expired or not, and dialing a prestored telephone number through a virtual telephone when the unique code is expired; the third-party server comprises a receiving unit and an identification unit; the receiving unit is used for receiving the voice signals uploaded by the local server, the identifying unit is used for identifying whether the voice signals contain children crying or not, and when the voice signals contain the children crying, the preset unique codes are sent to the local server as feedback data.
Preferably, the above intelligent infant monitoring system evidence: the sound collection device adopts a miniature microphone/decibel meter.
More preferably, the intelligent infant monitoring system of the preceding claims, further comprising: the sound collection device is integrated in the intelligent watch.
The application further discloses an intelligent infant monitoring system based on the intelligent infant monitoring system.
The technical scheme is as follows:
an intelligent infant monitoring method, comprising the steps of:
s1: capturing environmental sound by a sound acquisition device to form a voice signal;
s2: the local server reads the sound acquisition device, stores the voice signal obtained in the S1 in the storage unit and uploads the voice signal to the third-party server through the communication unit;
s3: the third-party server receives and analyzes the voice signal, and jumps to S4 when the voice signal does not contain the crying of the child, and jumps to S5 when the voice signal contains the crying of the child;
s4: the third-party server outputs a termination signal to the local server, and the verification unit terminates the flow of the local server;
s5: the third-party server extracts a pre-stored unique code and feeds the unique code back to the local server;
s6: the verification unit verifies the unique code of S5, and if the unique code is overdue, the local server process is terminated; if the unique code is not expired, go to S7:
s7: the local server dials the prestored mobile phone number through the virtual phone.
Preferably, in the intelligent infant monitoring method, the step S3 includes:
s31: the third-party server (3) receives and analyzes the voice signal to obtain voiceprint characteristics;
s32: and matching the voiceprint features obtained in the step S31 with the standard feature vectors stored in the voiceprint library, if the matching is not successful, determining that the child cry is not contained, and jumping to the step S4, and if the matching is successful, determining that the child cry is contained, and jumping to the step S5.
More preferably, in the intelligent infant monitoring method:
in the step S2, the local server synchronously generates and uploads a local timestamp when uploading the voice signal obtained in the step S1 to the third-party server through the communication unit;
in the step S5, the third party server outputs S2 the local timestamp to the local server while extracting the pre-stored unique code and feeding back the extracted unique code to the local server;
in the step S6, the verification unit compares the local timestamp of S5 with the current time to obtain a time difference, determines that the unique code is expired and terminates the process of the local server (2) if the time difference is greater than or equal to a pre-stored time difference threshold, and determines that the unique code is not expired if the time difference is less than the pre-stored time difference threshold, and then proceeds to S7.
Compared with the prior art, the invention has the following beneficial effects:
the server is extremely low in burden, sound does not need to be transmitted, and only the unique code needs to be transmitted, so that the transmission bandwidth is small, the speed is high, the user experience is improved, and much time is saved; here, the uploaded sound file is basically about 15s, so the broadband requirement of the transmission is not very high.
Drawings
The invention will be described in further detail with reference to the following detailed description and accompanying drawings:
FIG. 1 is a block diagram of the present invention;
fig. 2 is a flow chart of the operation of the present invention.
Detailed Description
In order to more clearly illustrate the technical solution of the present invention, the following will be further described with reference to various embodiments.
Example 1 as shown in fig. 1-2:
an intelligent infant monitoring system, comprising: the system comprises a sound collection device 1, a local server 2 and a third-party server 3;
the sound collection device 1 is used for collecting environmental sound and forming a voice signal; the local server 2 comprises a storage unit, a communication unit, a verification unit and a decision unit; the storage unit is used for reading the sound acquisition device 1 and storing the voice signal; the communication unit is used for realizing data interaction between the local server 2 and the third-party server 3, uploading voice signals to the third-party server 3 and receiving feedback data of the third-party server 3; the checking unit is used for reading the feedback data and starting the decision unit when the feedback data is a preset unique code; the decision unit is used for comparing whether the unique code is expired or not, and dialing a prestored telephone number through a virtual telephone when the unique code is expired; the third party server 3 comprises a receiving unit and an identifying unit; the receiving unit is used for receiving the voice signals uploaded by the local server 2, the identifying unit is used for identifying whether the voice signals contain children crying, and when the voice signals contain the children crying, the preset unique codes are used as feedback data to be sent to the local server 2.
In practice, the working process is as follows:
an intelligent infant monitoring method, comprising the steps of:
s1: capturing environmental sound by using the sound acquisition device 1 to form a voice signal;
s2: the local server 2 reads the sound collection device 1, stores the voice signal obtained in the step S1 in a storage unit, uploads the voice signal to the third-party server 3 through a communication unit, and synchronously generates and uploads a local timestamp;
s31: the third-party server 3 receives and analyzes the voice signal to obtain voiceprint characteristics;
s32: matching the voiceprint features obtained in the step S31 with standard feature vectors stored in a voiceprint library, if the matching is not successful, determining that the child cry is not contained, and jumping to the step S4, and if the matching is successful, determining that the child cry is contained, and jumping to the step S5;
s4: the third-party server outputs a termination signal to the local server, and the verification unit terminates the flow of the local server 2;
s5: the third party server 3 extracts the pre-stored unique code and feeds back the unique code to the local server 2, and meanwhile, the local timestamp is output to the local server S2; in this example: the unique code is b89d9d70312b64dbd67cca37b35f9f7 e;
s6: the check unit checks the unique code of S5: comparing the local timestamp of S5 with the current time to obtain a time difference, if the time difference is greater than or equal to a pre-stored time difference threshold, determining that the unique code is expired and terminating the process of the local server 2, if the time difference is less than the pre-stored time difference threshold, determining that the unique code is not expired, and jumping to S7;
s7: the local server 2 dials the prestored mobile phone number through the virtual phone.
In the above steps, the parsing process of the voice signal is as follows: the human vocal process can be seen as a linear system of airflow in the lungs through the vocal cords. If e (t) represents the input excitation (pitch) of the sound, and h (t) represents the response of the vocal cords (i.e. the features we need to obtain), the heard speech signal is x (t) ═ e (t) × h (t), i.e. the convolution of the two. In the frequency domain, the product x (w) of the two x (w) and e (w) and h (w); typically in frequency domain analysis we only focus on the energy of the spectrum, ignoring its phase information, i.e., | x (w) | e (w) | h (w) |; (ii) subsequently calculating a frequency spectrum as a function of log | x (w) | e (w) | + log | h (w) |; finally, Fourier inversion is carried out to obtain a cepstrum coefficient: c (n) ═ IDFT (log | x (w) |) | IDFT (log | e (w) | + log | h (w) |)
The above description is only an embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. The protection scope of the present invention is subject to the protection scope of the claims.

Claims (6)

1. An intelligent infant monitoring system, comprising: the system comprises a sound collection device (1), a local server (2) and a third-party server (3);
the sound collection device (1) is used for collecting environmental sound, forming a voice signal and uploading the voice signal to the local server (2) in a binary form;
the local server (2) comprises a storage unit, a communication unit, a verification unit and a decision unit;
the storage unit is used for storing the voice signals uploaded by the sound acquisition device (1);
the communication unit is used for sending the voice signals uploaded by the sound acquisition device (1) to the third-party server (3) and receiving feedback signals of the third-party server (3); the feedback signal of the third party server (3) is a termination signal or a unique coding signal;
the checking unit is used for reading the feedback signal, terminating the flow when the feedback signal is a termination signal, and starting the decision unit when the feedback signal is a unique code;
the decision unit is used for checking the unique code when starting, and automatically dialing a prestored telephone number through the virtual telephone if the unique code is expired;
the third party server (3) comprises a receiving unit and an identifying unit; the voice recognition system comprises a receiving unit, an identification unit and a local server (2), wherein the receiving unit is used for receiving voice signals uploaded by the local server (2), a standard feature vector and a unique code are prestored in the identification unit and used for analyzing voiceprint features in the voice signals, matching the voiceprint features with the standard feature vector, and sending the prestored unique code as feedback data to the local server (2) when the similarity between the voiceprint features and the standard feature vector reaches a threshold value.
2. The intelligent infant monitoring system of claim 1, wherein: the sound collection device (1) adopts a miniature microphone or a decibel meter.
3. The intelligent infant monitoring system of claim 2, wherein: the sound collection device (1) is integrated in the intelligent watch.
4. An intelligent infant monitoring method is characterized by comprising the following steps:
s1: capturing environmental sound by a sound acquisition device (1) to form a voice signal;
s2: the local server (2) reads the sound acquisition device (1), stores the voice signal obtained by the S1 in a storage unit and uploads the voice signal to the third-party server (3) through a communication unit;
s3: the third-party server (3) receives and analyzes the voice signal, and jumps to S4 when the voice signal does not contain the crying of the child, and jumps to S5 when the voice signal contains the crying of the child;
s4: the third-party server outputs a termination signal to the local server, and the verification unit terminates the flow of the local server (2);
s5: the third-party server (3) extracts the pre-stored unique code and feeds the unique code back to the local server (2);
s6: the verification unit verifies the unique code of S5, and if the unique code is expired, the flow of the local server (2) is terminated; if the unique code is not expired, go to S7:
s7: the local server (2) dials the prestored mobile phone number through the virtual phone.
5. The intelligent infant monitoring method of claim 4, wherein the step S3 includes:
s31: the third-party server (3) receives and analyzes the voice signal to obtain voiceprint characteristics;
s32: and matching the voiceprint features obtained in the step S31 with the standard feature vectors stored in the voiceprint library, if the matching is not successful, determining that the child cry is not contained, and jumping to the step S4, and if the matching is successful, determining that the child cry is contained, and jumping to the step S5.
6. The intelligent infant monitoring method of claim 5, wherein:
in the step S2, the local server (2) synchronously generates and uploads a local timestamp when the voice signal obtained in the step S1 is uploaded to the third-party server (3) through the communication unit;
in the step S5, the third party server extracts the pre-stored unique code and feeds back the unique code to the local server (2), and simultaneously outputs S2 the local timestamp to the local server;
in the step S6, the verification unit compares the local timestamp of S5 with the current time to obtain a time difference, determines that the unique code is expired and terminates the process of the local server (2) if the time difference is greater than or equal to a pre-stored time difference threshold, and determines that the unique code is not expired if the time difference is less than the pre-stored time difference threshold, and then proceeds to S7.
CN202011491638.1A 2020-12-16 2020-12-16 Intelligent infant monitoring system and method Pending CN112992136A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011491638.1A CN112992136A (en) 2020-12-16 2020-12-16 Intelligent infant monitoring system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011491638.1A CN112992136A (en) 2020-12-16 2020-12-16 Intelligent infant monitoring system and method

Publications (1)

Publication Number Publication Date
CN112992136A true CN112992136A (en) 2021-06-18

Family

ID=76345022

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011491638.1A Pending CN112992136A (en) 2020-12-16 2020-12-16 Intelligent infant monitoring system and method

Country Status (1)

Country Link
CN (1) CN112992136A (en)

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1326510C (en) * 1988-05-12 1994-01-25 Jeffrey Scott Childress Trunked radio repeater system
EP2615773A1 (en) * 2012-01-10 2013-07-17 Thomson Licensing Method and device for timestamping data and method and device for verification of a timestamp
CN103702072A (en) * 2013-12-11 2014-04-02 乐视致新电子科技(天津)有限公司 Visual terminal-based monitoring method and visual terminal
CN104104927A (en) * 2014-07-30 2014-10-15 天津三星电子有限公司 Infant monitoring and reminding method and system based on television
CN104935555A (en) * 2014-03-20 2015-09-23 华为技术有限公司 Client certificate authentication method, server, client and system
CN106257480A (en) * 2015-06-05 2016-12-28 北京京东尚科信息技术有限公司 A kind of method and device preventing the robot tool malicious access page
CN107591162A (en) * 2017-07-28 2018-01-16 南京邮电大学 Sob recognition methods and intelligent safeguard system based on pattern match
US20180039990A1 (en) * 2016-08-05 2018-02-08 Nok Nok Labs, Inc. Authentication techniques including speech and/or lip movement analysis
CN109509484A (en) * 2018-12-25 2019-03-22 科大讯飞股份有限公司 A kind of prediction technique and device of baby crying reason
CN109658953A (en) * 2019-01-12 2019-04-19 深圳先进技术研究院 A kind of vagitus recognition methods, device and equipment
CN110223697A (en) * 2019-06-13 2019-09-10 苏州思必驰信息科技有限公司 Interactive method and system
CN110758241A (en) * 2019-08-30 2020-02-07 华为技术有限公司 Occupant protection method and apparatus

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1326510C (en) * 1988-05-12 1994-01-25 Jeffrey Scott Childress Trunked radio repeater system
EP2615773A1 (en) * 2012-01-10 2013-07-17 Thomson Licensing Method and device for timestamping data and method and device for verification of a timestamp
CN103702072A (en) * 2013-12-11 2014-04-02 乐视致新电子科技(天津)有限公司 Visual terminal-based monitoring method and visual terminal
CN104935555A (en) * 2014-03-20 2015-09-23 华为技术有限公司 Client certificate authentication method, server, client and system
CN104104927A (en) * 2014-07-30 2014-10-15 天津三星电子有限公司 Infant monitoring and reminding method and system based on television
CN106257480A (en) * 2015-06-05 2016-12-28 北京京东尚科信息技术有限公司 A kind of method and device preventing the robot tool malicious access page
US20180039990A1 (en) * 2016-08-05 2018-02-08 Nok Nok Labs, Inc. Authentication techniques including speech and/or lip movement analysis
CN107591162A (en) * 2017-07-28 2018-01-16 南京邮电大学 Sob recognition methods and intelligent safeguard system based on pattern match
CN109509484A (en) * 2018-12-25 2019-03-22 科大讯飞股份有限公司 A kind of prediction technique and device of baby crying reason
CN109658953A (en) * 2019-01-12 2019-04-19 深圳先进技术研究院 A kind of vagitus recognition methods, device and equipment
CN110223697A (en) * 2019-06-13 2019-09-10 苏州思必驰信息科技有限公司 Interactive method and system
CN110758241A (en) * 2019-08-30 2020-02-07 华为技术有限公司 Occupant protection method and apparatus

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
杜仲平等: "基于音频监控的婴儿智能监护系统设计", 《计算机测量与控制》 *

Similar Documents

Publication Publication Date Title
US10832686B2 (en) Method and apparatus for pushing information
US20150112678A1 (en) Sound capturing and identifying devices
US20170365259A1 (en) Dynamic password voice based identity authentication system and method having self-learning function
CN109473104B (en) Voice recognition network delay optimization method and device
CN108305628B (en) Speech recognition method, speech recognition device, computer equipment and storage medium
CN111916061B (en) Voice endpoint detection method and device, readable storage medium and electronic equipment
CN107591162A (en) Sob recognition methods and intelligent safeguard system based on pattern match
CN104123938A (en) Voice control system, electronic device and voice control method
CN108010526A (en) Method of speech processing and device
CN109545193A (en) Method and apparatus for generating model
WO2023098332A1 (en) Audio processing method, apparatus and device, medium, and program product
CN108364635A (en) A kind of method and apparatus of speech recognition
CN117111879A (en) Dial generation method and device, intelligent wearable device and storage medium
CN102334977A (en) Measuring device, mobile equipment, system for measuring physiological parameters of human bodies and wireless transmission method for measuring data
CN111768790B (en) Method and device for transmitting voice data
CN111933137B (en) Voice wake-up test method and device, computer readable medium and electronic equipment
US20230395094A1 (en) Speech processing method, computer storage medium, and electronic device
CN112992136A (en) Intelligent infant monitoring system and method
WO2024051823A1 (en) Method for managing reception information and back-end device
CN109754816B (en) Voice data processing method and device
CN102322928B (en) Electronic scale, mobile equipment, body weight measuring system and wireless transmission method
CN110688470B (en) Method and apparatus for transmitting information
KR101119867B1 (en) Apparatus for providing information of user emotion using multiple sensors
CN114283493A (en) Artificial intelligence-based identification system
CN109286922B (en) Bluetooth prompt tone processing method, system, readable storage medium and Bluetooth device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210618