WO2019127088A1 - Snore recognition method and snore-stopping device - Google Patents

Snore recognition method and snore-stopping device Download PDF

Info

Publication number
WO2019127088A1
WO2019127088A1 PCT/CN2017/118950 CN2017118950W WO2019127088A1 WO 2019127088 A1 WO2019127088 A1 WO 2019127088A1 CN 2017118950 W CN2017118950 W CN 2017118950W WO 2019127088 A1 WO2019127088 A1 WO 2019127088A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
click
signal
window function
audio
Prior art date
Application number
PCT/CN2017/118950
Other languages
French (fr)
Chinese (zh)
Inventor
齐奇
Original Assignee
深圳和而泰数据资源与云技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳和而泰数据资源与云技术有限公司 filed Critical 深圳和而泰数据资源与云技术有限公司
Priority to CN201780009009.0A priority Critical patent/CN108697328B/en
Priority to PCT/CN2017/118950 priority patent/WO2019127088A1/en
Publication of WO2019127088A1 publication Critical patent/WO2019127088A1/en

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/48Other medical applications
    • A61B5/4806Sleep evaluation
    • A61B5/4818Sleep apnoea
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/08Detecting, measuring or recording devices for evaluating the respiratory organs
    • A61B5/0826Detecting or evaluating apnoea events
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/68Arrangements of detecting, measuring or recording means, e.g. sensors, in relation to patient
    • A61B5/6801Arrangements of detecting, measuring or recording means, e.g. sensors, in relation to patient specially adapted to be attached to or worn on the body surface
    • A61B5/6802Sensor mounted on worn items
    • A61B5/681Wristwatch-type devices
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7203Signal processing specially adapted for physiological signals or for diagnostic purposes for noise prevention, reduction or removal
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/7253Details of waveform analysis characterised by using transforms
    • A61B5/7257Details of waveform analysis characterised by using transforms using Fourier transforms
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61FFILTERS IMPLANTABLE INTO BLOOD VESSELS; PROSTHESES; DEVICES PROVIDING PATENCY TO, OR PREVENTING COLLAPSING OF, TUBULAR STRUCTURES OF THE BODY, e.g. STENTS; ORTHOPAEDIC, NURSING OR CONTRACEPTIVE DEVICES; FOMENTATION; TREATMENT OR PROTECTION OF EYES OR EARS; BANDAGES, DRESSINGS OR ABSORBENT PADS; FIRST-AID KITS
    • A61F5/00Orthopaedic methods or devices for non-surgical treatment of bones or joints; Nursing devices; Anti-rape devices
    • A61F5/56Devices for preventing snoring

Definitions

  • the present application relates to a health monitoring device, and more particularly to a snoring recognition method and a snoring device.
  • OSA obstructive sleep apnea
  • soft tissue relaxation near the throat causes upper airway obstruction, and narrowing of the airway leads to apnea during sleep.
  • OSAHS obstructive sleep apnea hypopnea syndrome (obstructive sleep Apnea-hypopnea syndrome).
  • OSAHS refers to apnea and hypopnea caused by airway collapse obstruction during sleep, accompanied by snoring, sleep structure disorder, frequent occurrence of oxygen saturation, and daytime sleepiness.
  • Apnea refers to the cessation of nasal and nasal airflow during sleep ⁇ 10s.
  • Hypnea refers to the reduction of respiratory airflow during sleep more than 30% above the baseline level, with 3% oxygen saturation (SaO) or arousal.
  • Sleep apnea syndrome more than 30 apneas occurred during 7 hours of sleep, each time the airflow is stopped for more than 10s (including 10s), or the average number of sleep apnea hypopneas (respiratory disorder index) exceeds 5 times.
  • Clinical syndrome that causes chronic hypoxemia and hypercapnia Can be divided into central type, blocking type and mixed type.
  • Snoring is the early stage of sleep apnea syndrome (obstructive sleep apnea), which belongs to different periods of the same disease. If you do not promptly intervene, you will develop sleep apnea syndrome in a few decades.
  • Existing anti-snagging devices include a controller, a click recognition module, a judging module, and a reminder mechanism.
  • the humming recognition module monitors the snoring condition in real time while the user is asleep, collects ambient audio and recognizes snoring. After the buzz is recognized, the snoring information is sent to the controller, and the judging module of the controller drives the reminding mechanism to move when the snoring is determined to be snoring, and touches the user's body to stop the snoring.
  • HMM Hidden Markov Model
  • the application provides a snoring recognition method and a snoring device, and the snoring recognition is accurate, and the calculation amount of the snoring device is greatly reduced, and can adapt to the demand for miniaturization of the stagnation device, including the shackle bracelet, the snoring earphone, the snoring pillow, and the like. .
  • the embodiment of the present application provides a method for identifying a click, including:
  • the window function is framed, and the M1 sequence obtained by the frame is Fourier transformed into the frequency domain, and the low frequency energy of the frequency domain spectrum is summed;
  • the low-frequency energy and value form an N-sequence in the time domain, and the N-sequence is pre-judged, and the qualified N-sequence forms a M2 sequence;
  • the M2 sequence is Fourier transformed to the frequency domain, and the preset chirp parameters are matched according to the spectral peak characteristics of the M2 sequence, and the monitoring chirp signal is output.
  • the method for identifying the click sound further includes: the audio frame is moved by the first-in first-out frame at a 1/X rate, and X is an arbitrary number; wherein the method further includes an audio signal pre-emphasis process.
  • the beating pre-judging of the pair of N sequences comprises: calculating a maximum minimum value and a maximum difference value of the N-sequence data; wherein the parameter predicted by the click sound comprises a zero-crossing rate, a waveform amplitude, and a zero-crossing gap.
  • the method further comprises: normalizing the M2 sequence and truncating the average processing.
  • the window function is a Hanning window function (Haning) or a Hamming window function (Hamming) or a Blackman window function (Blackman).
  • the embodiment of the present application further provides a stagnation device, including a microcontroller, an analog-to-digital conversion circuit (ADC), and a sound pickup circuit, the sound pickup circuit is configured to sample a click audio signal, and the analog-to-digital conversion circuit And performing analog-to-digital (ADC) conversion on the sampling signal, further comprising: a click recognition module, configured to form an audio frame based on the time domain and the sampled signal, perform a click analysis on the audio frame shift frame; Function framing, transforming the M1 sequence obtained by the framing into the frequency domain, and summing the low-frequency energy of the frequency domain spectrum; for the low-frequency energy and value to form the N-sequence in the time domain, and the frequency-domain N-sequence By pre-judging, the pre-determined N sequence forms an M2 sequence; it is also used to perform Fourier transform on the M2 sequence to the frequency domain, and matches the preset chirp parameters according to the spectral peak characteristics of the M
  • the click recognition module is further configured to move the frame to the audio frame at a 1/X rate, where X is an arbitrary number; the click recognition module is further configured to pre-emphasize the audio signal.
  • the click recognition module is further configured to: when the sound is predicted, calculate a maximum minimum value and a maximum difference value of the N sequence data; wherein the parameters of the click sound prediction include a zero crossing rate, a waveform amplitude, and a zero crossing gap. .
  • the click recognition module is further configured to perform normalization processing and truncation averaging processing on the M2 sequence.
  • the window function is a Hanning window function (Haning) or a Hamming window function (Hamming) or a Blackman window function (Blackman).
  • the embodiment of the present application further provides an electronic device, including:
  • At least one processor and,
  • the memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the method as described above.
  • the embodiment of the present application further provides a non-transitory computer readable storage medium, where the computer readable storage medium stores computer executable instructions for causing a computer to execute the above The method described.
  • the embodiment of the present application further provides a computer program product, where the computer program product includes a computer program stored on a non-transitory computer readable storage medium, the computer program includes program instructions, when When the program instructions are executed by the computer, the computer is caused to perform the method as described above.
  • the method and device for detecting the click sound provided by the embodiment of the present application divides the click recognition after the window function is divided into the energy differentiation stage and the click characteristic comparison stage.
  • the recognition model is simplified, and the calculation amount is greatly reduced, which can be realized.
  • Miniaturization and portable design of anti-snoring products at the same time, in a quiet sleep environment, 100% accurate beep monitoring can be achieved for buzzing above 60 decibels.
  • FIG. 1 is a flowchart of an embodiment of a method for identifying a click sound provided by an embodiment of the present application
  • FIG. 2 is a product module diagram of a stagnation device provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of the action of band pass filtering of the click recognition method provided by the embodiment of the present application.
  • FIG. 5 is a general flowchart of a method for identifying a click sound provided by an embodiment of the present application
  • FIG. 6 is a schematic diagram of the energy summation of the M1 sequence in the click recognition method provided by the embodiment of the present application.
  • FIG. 7 is a schematic diagram of a click recognition of an M2 sequence in a click recognition method provided by an embodiment of the present application.
  • FIG. 9 is a schematic diagram showing the hardware structure of an electronic device for performing a click recognition method according to an embodiment of the present application.
  • the present application relates to a click recognition method and a stop device using the same.
  • the snoring recognition method and device divides the snoring recognition into an energy accumulation phase and a snoring feature comparison phase, and the snoring recognition model is simplified, the calculation amount is greatly reduced, and the miniaturization and portable design of the stagnation product can be realized; meanwhile, in a quiet sleep environment For a beep of more than 60 decibels, 100% accurate bark monitoring can be achieved.
  • the energy accumulation phase and the click characteristic comparison phase are all completed by the click recognition module.
  • the stagnation device of the present embodiment includes a microcontroller 10, an analog-to-digital conversion circuit (ADC) 30, a click recognition module 40, a sound pickup circuit 20, and a stop mechanism. 50.
  • ADC stands for Analog to Digital Converter.
  • the microcontroller 10 controls the operation of the entire stagnation device.
  • the sound pickup circuit 20 samples the click audio signal.
  • the sound pickup circuit 20 acquires a waveform signal (analog signal) of the click sound through the microphone and its associated circuit.
  • the analog to digital conversion circuit performs analog to digital (ADC) conversion on the sampled signal.
  • the anti-snoring mechanism 50 can be implemented in various ways.
  • the anti-snoring mechanism should be designed to be small and the wake-up power is moderate.
  • the size and the pressure awakening force of the stagnation mechanism are relatively larger than the design.
  • the connection to the microcontroller 10 can include a motor, a gear set, and a touch-pressure structure that transmits power from the motor and the gear set under the command of the microcontroller 10, through the touch-pressure structure to the detected sleep user. Perform a wake-up action.
  • the click recognition module forms an audio frame based on the time domain and the sampled signal.
  • the click recognition module performs a click analysis on the audio frame shift frame.
  • the click recognition module also advances the first-in-first-out frame of the audio frame at a rate of 1/X (where X is an arbitrary number).
  • X is an arbitrary number.
  • the value of X is 2.
  • the click recognition module selects the window function to divide the frame, and then transforms the M1 sequence obtained by the frame into the frequency domain, and sums the low frequency energy of the frequency domain spectrum to complete the low frequency energy accumulation phase of the click recognition.
  • the click recognition module combines the low frequency energy and the value into an N sequence in the time domain, and then completes the click prediction of the N sequence, and predicts that the qualified back N sequence forms the M2 sequence.
  • the click recognition module further performs Fourier transform on the M2 sequence to the frequency domain, and matches the preset click parameters according to the spectral peak characteristics of the M2 sequence, and outputs a monitoring click signal of whether the sleep user is snoring.
  • the microcontroller 10 After receiving the monitoring click signal, the microcontroller 10 activates the stop mechanism 50 to touch the monitored user and wake up the sleep user to stop snoring.
  • the click recognition module of the embodiment of the present application is simple and fast to process audio signals, and only band pass filter calculation, windowing calculation, and fast Fourier transform (FFT) occupy relatively more computing resources, but the above calculation can be implemented in a common MCU. .
  • FFT fast Fourier transform
  • the click recognition module starts the click analysis after the frame is moved at the 1/X rate.
  • the value of X is 2. That is, the data M1 sequence is 1/2 frame shifted, assuming that the length of the M1 sequence is m, that is, the M1 sequence of length m is sequentially shifted by m/2 length.
  • the microcontroller 10 converts the sound waveform signal (analog signal) acquired by the sound pickup circuit 20 into a digital signal through an analog-to-digital conversion circuit 30 (ADC), and then performs the click recognition calculation and processing by the click recognition module 40.
  • ADC analog-to-digital conversion circuit 30
  • the click recognition module 40 calculates the maximum and minimum values of the N-series data during the pre-judgment in order to facilitate the high-frequency determination. In addition, the click prediction is performed.
  • the parameters used in the steps include the zero-crossing rate, the amplitude of the waveform, and the zero-crossing gap, eliminating the signal that is initially judged to be non-squeaky.
  • the click recognition module 40 is further configured to perform normalization processing and truncation averaging processing on the M2 sequence.
  • the window function used in this application may be a Hanning window function or a Hamming window function or a Blackman window function (Blackman).
  • the framing signal is processed by a Hanning window to obtain a short-time audio M1 sequence
  • the length of the short-time audio M1 sequence is m, multiplied by the element m by the Hanning window.
  • the coefficient of the filter can be considered to be replaced with other window functions.
  • FFT Fast Fourier Transform
  • FIG. 1 and FIG. 5 together, a main flowchart and a detailed flowchart of an embodiment of a method for identifying a click sound according to an embodiment of the present application are shown.
  • the click recognition method is divided into an energy discrimination phase 100 and a click feature comparison phase 300, wherein a click prediction step 200 is set in the middle of the energy discrimination phase 100 and the click feature comparison phase 300. Both the energy discrimination phase 100 and the click feature comparison phase 300 are performed in the frequency domain.
  • the specific method is as follows:
  • Step S101 sampling the chirp audio signal.
  • the chirp signal sampling uses a hardware microphone and a related circuit;
  • Step S103 Perform analog-to-digital (ADC) conversion on the sampling signal, and set the analog-to-digital conversion circuit 30 to perform analog-to-digital (ADC) conversion on the sampling signal;
  • Step S106 forming an audio frame based on the time domain and the sampling signal, shifting the frame of the audio frame, and starting the energy discrimination phase 100 of the click analysis.
  • the window function is framed, wherein the frame of the short-time audio data M1 is obtained by moving the frame of the audio frame and adopting a window function, and the framed signal is processed by Hanning window.
  • the length of the short-term audio M1 sequence is m, multiplied by the element m by the coefficient of the Hanning window filter.
  • This window function can be replaced by other window functions, and there is no spectrum leakage when fast Fourier transform (FFT) calculation is required.
  • FFT fast Fourier transform
  • the click recognition method includes performing band pass filtering on the sampled audio frame.
  • the frame shift in the step S106 is for an audio frame, and the audio frame is FIFO framed at a 1/X rate; the short-time audio M1 sequence can be 1/2 framed or 1/x framed (where x For any number), the data needs to be met continuously for the first-in, first-out move operation.
  • the method further comprises pre-emphasizing the audio signal of the sequence.
  • the click recognition module is also used to pre-emphasize the audio signal.
  • the pre-emphasis processing of the audio signal uses a digital high-pass filter for boosting the amplitude of the high-frequency portion of the speech signal to flatten the spectrum.
  • Step S108 Fourier transforming the M1 sequence subjected to pre-emphasis processing to the frequency domain, and summing the low-frequency energy of the frequency domain spectrum; refer to the energy summation diagram of the M1 sequence shown in FIG.
  • the energy is mainly concentrated in the low frequency part, and the long-term energy signal sequence is composed by acquiring the short-time low-frequency signal energy.
  • the M1 sequence is subjected to Fast Fourier Transform (FFT), and a short-time audio waveform sequence (a sequence of m points) is subjected to FFT conversion of m points to obtain a spectrum distribution result of short-time audio.
  • FFT Fast Fourier Transform
  • a short-time audio waveform sequence (a sequence of m points) is subjected to FFT conversion of m points to obtain a spectrum distribution result of short-time audio.
  • calculating the low frequency energy sum value T is summing the low frequency energy of the spectrum distribution of the short time audio M1 sequence.
  • Step S110 The low frequency energy and the value T form an N sequence in the time domain. Specifically, the calculated low frequency energy T and the value data are stored in the N sequence of the N length to obtain a long time N sequence of the audio information. Before the click characteristic phase, the N sequence is pre-determined 200, and the qualified N sequence forms an M2 sequence that enters the click characteristic phase to make the final click judgment.
  • the click prediction step 200 includes:
  • the long-time N-sequence is filtered to remove DC processing to remove the apparently non-squeaky energy signal, wherein the N-sequence filtering DC processing is used to filter the DC component in the N-sequence data to obtain an alternating sequence.
  • the maximum and minimum calculation of the long-time N-sequence data and the maximum difference calculation wherein the N-sequence maximum, minimum calculation, and maximum difference calculation can be performed by first obtaining the maximum value of the N-sequence. The minimum value is then calculated as the difference between the maximum and minimum values;
  • the squeak pre-judgment also includes the elimination of irregular audio data, high-noise noise such as car whistle, etc., and the culling pre-judgment parameters include zero-crossing rate, waveform amplitude, and zero-crossing gap.
  • Step S112 Predetermine the qualified N sequence to form the M2 sequence, and enter the click feature comparison phase 300.
  • the click recognition method further includes: performing normalization processing and truncation averaging processing on the M2 sequence.
  • the N-sequence normalization process, the truncated averaging process normalizes the element amplitude values of the sequence N to a uniform data value size interval by the maximum value and the minimum value.
  • the data is subjected to averaging processing of the short sequence n (n ⁇ N / 4).
  • the normalized and averaged sequence is the M2 sequence.
  • the normalization process and the truncated averaging process have little effect on the recognition calculation result, and the normalization process and the truncation averaging process step may not be set.
  • the window function of the normalization processing and the truncated averaging processing also adopts the Hanning window function (Hanning) as described above.
  • Step S114 Fourier transforming the M2 sequence to the frequency domain, and matching the preset chirp parameter according to the spectral peak characteristic of the M2 sequence, and outputting the monitoring chirp signal to the anti-snoring device.
  • the M1 sequence is subjected to Fast Fourier Transform (FFT), and the M1 point fast Fourier transform is performed based on the M1 sequence or the N sequence (when the pre-judging step is set), and the spectrum distribution result of the long-term audio is obtained.
  • FFT Fast Fourier Transform
  • the information of the three largest spectral peaks (P1, P2, P3) in the spectral distribution is recorded. If the three spectral peaks (P1, P2, P3) satisfy the determination condition, it is determined that the long-term audio is Beep, and output a beep signal; otherwise, it is recognized as a non-beep, and then a non-beep signal is output.
  • FIG. 8 is a flow chart of the click characteristic comparison of the click recognition method.
  • the final screening of the click is different according to the characteristics of the click.
  • the subsequent judgment process of the three points (P1, P2, P3) with the largest peak of the spectrum found from the M2 sequence is as follows:
  • the squeaking characteristic is further determined, and it is further determined whether the frequency greater than the multiple 2 meets the breathing rate, and the squeak is determined when the breathing rate is met; when the breathing rate is not met, the sound is determined to be non-squeaking;
  • the single frequency point multiple is less than 2, it does not meet the snoring characteristics, and it can be further determined whether the three frequency points all correspond to the breathing rate, and the squeak is determined when the breathing rate is met; when the breathing rate is not met, the sound is determined to be non-squeaking.
  • the above embodiment is judged by the maximum three spectral peaks, and the determination rule can be specifically determined based on single or multiple spectral peaks.
  • the snoring recognition method and the snoring device of the present application divide the snoring recognition into an energy accumulation phase and a snoring feature comparison phase, and the snoring recognition model is simplified, the calculation amount is greatly reduced, and the miniaturization and portable design of the stagnation product can be realized while ensuring high
  • the detection accuracy of the click sound; the method and device for detecting the click sound accurately recognizes the click sound with high signal to noise, and the spectrum energy of the click sound is mainly concentrated in the low frequency part, and the long-term energy signal sequence is composed by acquiring the short-time low-frequency signal energy and then the long-term energy is generated. The sequence is subjected to spectrum analysis to obtain whether the long-term audio is a click sound, and the click recognition accuracy is high.
  • FIG. 9 is a schematic diagram of the hardware structure of the electronic device 600 according to the method for identifying the click sound provided by the embodiment of the present application. As shown in FIG. 9, the electronic device 600 includes:
  • microcontroller 610 One or more of the microcontroller 610 and the memory 620, a microcontroller 610 is exemplified in FIG.
  • the microcontroller 610 and the memory 620 can be connected by a bus or other means, as exemplified by a bus connection in FIG.
  • the memory 620 is used as a non-volatile computer readable storage medium, and can be used for storing a non-volatile software program, a non-volatile computer-executable program, and a module, such as a program instruction corresponding to the click recognition method in the embodiment of the present application. / Module (for example, the click recognition module 40 shown in Figure 2).
  • the microcontroller 610 performs various function applications of the terminal device or the server and completes data processing by executing non-volatile software programs, instructions, and modules stored in the memory 620, that is, implementing the click recognition method of the above method embodiments.
  • the memory 620 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to the use of the click recognition device, and the like.
  • memory 620 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device.
  • memory 620 can optionally include memory remotely located relative to microcontroller 610, which can be connected to the click recognition device via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the one or more modules are stored in the memory 620, and when executed by the one or more microcontrollers 610, perform a click recognition method in any of the above method embodiments, for example, performing the above described FIG.
  • the electronic device of the embodiment of the present application exists in various forms, including but not limited to:
  • Mobile communication devices These devices are characterized by mobile communication functions and are mainly aimed at providing voice and data communication.
  • Such terminals include: smart phones (such as iPhone), multimedia phones, functional phones, and low-end phones.
  • Ultra-mobile personal computer equipment This type of equipment belongs to the category of personal computers, has computing and processing functions, and generally has mobile Internet access.
  • Such terminals include: PDAs, MIDs, and UMPC devices, such as the iPad.
  • Server A device that provides computing services.
  • the server consists of a microcontroller, a hard disk, a memory, a system bus, etc.
  • the server is similar to a general-purpose computer architecture, but because of the need to provide highly reliable services, it is capable of processing and stable. Sex, reliability, security, scalability, manageability, etc. are highly demanding.
  • Embodiments of the present application provide a non-transitory computer readable storage medium storing computer-executable instructions that are executed by one or more microcontrollers, such as FIG.
  • One of the microcontrollers 610 can cause the one or more microcontrollers to perform the click recognition method in any of the above method embodiments, for example, to perform the method steps 101 to S114 in FIG. 1 described above, FIG. In steps 100 through 300 of the method, the functions of the modules 31-34 in FIG. 4 are implemented, and the functions of the click recognition module 40 in FIG. 2 are implemented.
  • the device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (Random Access). Memory, RAM), etc.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Public Health (AREA)
  • Animal Behavior & Ethology (AREA)
  • Physics & Mathematics (AREA)
  • Biomedical Technology (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Veterinary Medicine (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Pathology (AREA)
  • Molecular Biology (AREA)
  • Surgery (AREA)
  • Biophysics (AREA)
  • Physiology (AREA)
  • Signal Processing (AREA)
  • Psychiatry (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Pulmonology (AREA)
  • Orthopedic Medicine & Surgery (AREA)
  • Mathematical Physics (AREA)
  • Otolaryngology (AREA)
  • Vascular Medicine (AREA)
  • Nursing (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)

Abstract

A snore recognition method and a snore-stopping device. The method comprises: sampling a snore audio signal; carrying out analog-to-digital (ADC) conversion on the sampled signal; forming an audio frame based on a time domain and the sampled signal, and carrying out frame shifting on the audio frame to carry out snore analysis; carrying out window function framing, and carrying out a Fourier transform on an M1 sequence obtained by means of framing to transform same into a frequency domain, and summating the low-frequency energy of the frequency domain spectrum; forming, by means of the summated value of the low-frequency energy, an N sequence in the time domain and carrying out snore prediction on the N sequence, and forming an M2 sequence by means of an N sequence that is predicted to be qualified; and carrying out a Fourier transform on the M2 sequence to transform same into the frequency domain and matching pre-set snore parameters according to the spectral peak features of the M2 sequence, and then outputting a monitored snore signal.

Description

一种鼾声识别方法及止鼾装置Beep recognition method and anti-snoring device 技术领域Technical field
本申请涉及健康监测设备,特别是涉及一种鼾声识别方法及止鼾装置。The present application relates to a health monitoring device, and more particularly to a snoring recognition method and a snoring device.
背景技术Background technique
正常人睡眠时,会因喉咙附近的软组织松弛产生变形,从而导致上呼吸道狭窄,阻碍呼吸的顺畅性,也就是常见的打鼾。When a normal person sleeps, it will be deformed due to the relaxation of the soft tissue near the throat, which leads to the narrowing of the upper airway and hinders the smoothness of the breathing, which is a common snoring.
如果打鼾成为睡眠习惯,影响呼吸的打鼾日积月累会潜藏健康隐患。比如阻塞型睡眠呼吸暂停(ObstructiveApnea,OSA),喉咙附近的软组织松弛而造成上呼吸道阻塞,呼吸道收窄引致睡眠时呼吸暂停。或者OSAHS,阻塞性睡眠呼吸暂停低通气综合征(obstructive sleep apnea-hypopnea syndrome)。OSAHS是指睡时上气道塌陷阻塞引起的呼吸暂停和通气不足、伴有打鼾、睡眠结构紊乱、频繁发生血氧饱和度下降、白天嗜睡等病征。呼吸暂停是指睡眠过程中口鼻气流停止≥10s.低通气是指睡眠过程中呼吸气流强度较基础水平降低超过30%以上,并伴血氧饱和度(SaO)3%或伴有觉醒。睡眠呼吸暂停综合症,在连续7h睡眠中发生30次以上的呼吸暂停,每次气流中止10s以上(含10s),或平均每小时睡眠呼吸暂停 低通气次数(呼吸紊乱指数)超过5次,而引起慢性低氧血症及高碳酸血症的临床综合征.可分为中枢型、阻塞型及混合型。If snoring becomes a sleep habit, the snoring that affects breathing will accumulate health risks. For example, obstructive sleep apnea (OSA), soft tissue relaxation near the throat causes upper airway obstruction, and narrowing of the airway leads to apnea during sleep. Or OSAHS, obstructive sleep apnea hypopnea syndrome (obstructive sleep Apnea-hypopnea syndrome). OSAHS refers to apnea and hypopnea caused by airway collapse obstruction during sleep, accompanied by snoring, sleep structure disorder, frequent occurrence of oxygen saturation, and daytime sleepiness. Apnea refers to the cessation of nasal and nasal airflow during sleep ≥10s. Hypnea refers to the reduction of respiratory airflow during sleep more than 30% above the baseline level, with 3% oxygen saturation (SaO) or arousal. Sleep apnea syndrome, more than 30 apneas occurred during 7 hours of sleep, each time the airflow is stopped for more than 10s (including 10s), or the average number of sleep apnea hypopneas (respiratory disorder index) exceeds 5 times. Clinical syndrome that causes chronic hypoxemia and hypercapnia. Can be divided into central type, blocking type and mixed type.
打鼾是睡眠呼吸暂停综合症(阻塞性睡眠呼吸暂停)的初期阶段,二者属于同一疾病的不同时期。如果打鼾不进行及时的干预,数十年后,将会发展为睡眠呼吸暂停综合症。Snoring is the early stage of sleep apnea syndrome (obstructive sleep apnea), which belongs to different periods of the same disease. If you do not promptly intervene, you will develop sleep apnea syndrome in a few decades.
现有的止鼾设备比如止鼾垫,包括了控制器、鼾声识别模块、判断模块和提醒机构。鼾声识别模块在用户睡眠时实时监测打鼾状况,采集环境音频并识别出鼾声。识别出鼾声后向控制器发送打鼾信息,控制器的判断模块在确定采集识别的是打鼾时,驱动提醒机构运动,触动用户身体使用户停止打鼾。Existing anti-snagging devices, such as anti-snag pads, include a controller, a click recognition module, a judging module, and a reminder mechanism. The humming recognition module monitors the snoring condition in real time while the user is asleep, collects ambient audio and recognizes snoring. After the buzz is recognized, the snoring information is sent to the controller, and the judging module of the controller drives the reminding mechanism to move when the snoring is determined to be snoring, and touches the user's body to stop the snoring.
但是现有技术中鼾声识别一般采用隐马尔科夫模型(HMM)语音识别的方式来进行鼾声识别。模型较复杂,计算量大,需要强大的计算芯片,不适合止鼾设备小型化/便携化的发展趋势。However, the prior art snoring recognition generally uses Hidden Markov Model (HMM) speech recognition to perform snoring recognition. The model is more complex, computationally intensive, requires a powerful computing chip, and is not suitable for the development trend of miniaturization/portability of anti-snagging equipment.
因此,现有技术亟待改进以解决其出现的技术问题。Therefore, the prior art needs to be improved to solve the technical problems that arise.
发明内容Summary of the invention
本申请提供一种鼾声识别方法及止鼾装置,鼾声识别准确,止鼾装置计算量大大减少,能够适应止鼾设备小型化的需求,包括止鼾手环,止鼾耳机以及止鼾枕等等。The application provides a snoring recognition method and a snoring device, and the snoring recognition is accurate, and the calculation amount of the snoring device is greatly reduced, and can adapt to the demand for miniaturization of the stagnation device, including the shackle bracelet, the snoring earphone, the snoring pillow, and the like. .
第一方面,本申请实施例提供了一种鼾声识别方法,包括:In a first aspect, the embodiment of the present application provides a method for identifying a click, including:
鼾声音频信号采样;Beep audio signal sampling;
对采样信号进行模数(ADC)转换;Performing analog-to-digital (ADC) conversion on the sampled signal;
基于时域和该采样信号形成音频帧,对音频帧移帧进行鼾声分析;Forming an audio frame based on the time domain and the sampled signal, and performing a click analysis on the audio frame shift frame;
窗函数分帧,将分帧得到的M1序列傅里叶变换至频域,并对频域频谱的低频能量求和;The window function is framed, and the M1 sequence obtained by the frame is Fourier transformed into the frequency domain, and the low frequency energy of the frequency domain spectrum is summed;
低频能量和值在时域上组成N序列,并对该N序列做鼾声预判,预判合格的N序列形成M2序列;The low-frequency energy and value form an N-sequence in the time domain, and the N-sequence is pre-judged, and the qualified N-sequence forms a M2 sequence;
对该M2序列傅里叶变换至频域,并根据该M2序列的谱峰特征匹配预设的鼾声参数,输出监测鼾声信号。The M2 sequence is Fourier transformed to the frequency domain, and the preset chirp parameters are matched according to the spectral peak characteristics of the M2 sequence, and the monitoring chirp signal is output.
其中,该鼾声识别方法还包括:音频帧以1/X速率先进先出移帧, X为任意数;其中,该方法还包括音频信号预加重处理。The method for identifying the click sound further includes: the audio frame is moved by the first-in first-out frame at a 1/X rate, and X is an arbitrary number; wherein the method further includes an audio signal pre-emphasis process.
优选的,该对N序列做鼾声预判包括:计算该N序列数据的最大最小值以及最大差值; 其中,该鼾声预判的参数包括过零率、波形幅值以及过零间隙。Preferably, the beating pre-judging of the pair of N sequences comprises: calculating a maximum minimum value and a maximum difference value of the N-sequence data; wherein the parameter predicted by the click sound comprises a zero-crossing rate, a waveform amplitude, and a zero-crossing gap.
为了提高鼾声识别精准度,该方法还包括:对该M2序列进行归一化处理、截短平均处理。In order to improve the accuracy of the click recognition, the method further comprises: normalizing the M2 sequence and truncating the average processing.
具体的,该窗函数为汉宁窗函数(Hanning)或者汉明窗函数(Hamming)或者布莱克曼窗函数(Blackman)。Specifically, the window function is a Hanning window function (Haning) or a Hamming window function (Hamming) or a Blackman window function (Blackman).
第二方面,本申请实施例还提供了一种止鼾装置,包括微控制器、模数转换电路(ADC)和声音拾取电路,该声音拾取电路用于采样鼾声音频信号,该模数转换电路用于对该采样信号进行模数(ADC)转换,还包括:鼾声识别模块,该鼾声识别模块用于基于时域和该采样信号形成音频帧,对音频帧移帧进行鼾声分析;用于窗函数分帧,将分帧得到的M1序列傅里叶变换至频域,并对频域频谱的低频能量求和;用于低频能量和值在时域上组成N序列,并对频域N序列做鼾声预判,预判合格的N序列形成M2序列;还用于对M2序列傅里叶变换至频域,并根据M2序列的谱峰特征匹配预设的鼾声参数,输出监测鼾声信号;以及止鼾机构,其中,该微控制器收到该监测鼾声信号后,启动止鼾机构触压受监测的用户。In a second aspect, the embodiment of the present application further provides a stagnation device, including a microcontroller, an analog-to-digital conversion circuit (ADC), and a sound pickup circuit, the sound pickup circuit is configured to sample a click audio signal, and the analog-to-digital conversion circuit And performing analog-to-digital (ADC) conversion on the sampling signal, further comprising: a click recognition module, configured to form an audio frame based on the time domain and the sampled signal, perform a click analysis on the audio frame shift frame; Function framing, transforming the M1 sequence obtained by the framing into the frequency domain, and summing the low-frequency energy of the frequency domain spectrum; for the low-frequency energy and value to form the N-sequence in the time domain, and the frequency-domain N-sequence By pre-judging, the pre-determined N sequence forms an M2 sequence; it is also used to perform Fourier transform on the M2 sequence to the frequency domain, and matches the preset chirp parameters according to the spectral peak characteristics of the M2 sequence, and outputs a monitoring chirp signal; The stagnation mechanism, wherein the microcontroller receives the monitoring squeak signal, and then activates the stagnation mechanism to touch the monitored user.
其中,该鼾声识别模块还用于对音频帧以1/X速率先进先出移帧,其中,X为任意数;该鼾声识别模块还用于对音频信号预加重处理。The click recognition module is further configured to move the frame to the audio frame at a 1/X rate, where X is an arbitrary number; the click recognition module is further configured to pre-emphasize the audio signal.
优选的,该鼾声识别模块还用于在鼾声预判时:计算该N序列数据的最大最小值以及最大差值; 其中,该鼾声预判的参数包括过零率、波形幅值以及过零间隙。Preferably, the click recognition module is further configured to: when the sound is predicted, calculate a maximum minimum value and a maximum difference value of the N sequence data; wherein the parameters of the click sound prediction include a zero crossing rate, a waveform amplitude, and a zero crossing gap. .
为了提高鼾声识别精准度,该鼾声识别模块还用于对该M2序列进行归一化处理、截短平均处理。In order to improve the accuracy of the click recognition, the click recognition module is further configured to perform normalization processing and truncation averaging processing on the M2 sequence.
具体的,该窗函数为汉宁窗函数(Hanning)或者汉明窗函数(Hamming)或者布莱克曼窗函数(Blackman)。Specifically, the window function is a Hanning window function (Haning) or a Hamming window function (Hamming) or a Blackman window function (Blackman).
第三方面,本申请实施例还提供了一种电子设备,包括:In a third aspect, the embodiment of the present application further provides an electronic device, including:
至少一个处理器;以及,At least one processor; and,
与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein
所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如上所述的方法。The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the method as described above.
第四方面,本申请实施例还提供了一种非易失性计算机可读存储介质,所述计算机可读存储介质存储有计算机可执行指令,所述计算机可执行指令用于使计算机执行如上所述的方法。In a fourth aspect, the embodiment of the present application further provides a non-transitory computer readable storage medium, where the computer readable storage medium stores computer executable instructions for causing a computer to execute the above The method described.
第五方面,本申请实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在非易失性计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行如上所述的方法。In a fifth aspect, the embodiment of the present application further provides a computer program product, where the computer program product includes a computer program stored on a non-transitory computer readable storage medium, the computer program includes program instructions, when When the program instructions are executed by the computer, the computer is caused to perform the method as described above.
本申请实施例提供的鼾声识别方法和装置,将窗函数分帧后的鼾声识别分为能量区分阶段和鼾声特征比对阶段,在安静睡眠环境下,识别模型精简,计算量大大减少,可实现止鼾产品的小型化和便携式设计;同时,在安静睡眠环境下,对60分贝以上的鼾声,可实现100%准确的鼾声监测。The method and device for detecting the click sound provided by the embodiment of the present application divides the click recognition after the window function is divided into the energy differentiation stage and the click characteristic comparison stage. In the quiet sleep environment, the recognition model is simplified, and the calculation amount is greatly reduced, which can be realized. Miniaturization and portable design of anti-snoring products; at the same time, in a quiet sleep environment, 100% accurate beep monitoring can be achieved for buzzing above 60 decibels.
附图说明DRAWINGS
一个或多个实施例通过与之对应的附图中的图片进行示例性说明,这些示例性说明并不构成对实施例的限定,附图中具有相同参考数字标号的元件表示为类似的元件,除非有特别申明,附图中的图不构成比例限制。The one or more embodiments are exemplified by the accompanying drawings in the accompanying drawings, and FIG. The figures in the drawings do not constitute a scale limitation unless otherwise stated.
图1是本申请实施例提供的一种鼾声识别方法实施例的流程图;1 is a flowchart of an embodiment of a method for identifying a click sound provided by an embodiment of the present application;
图2是本申请实施例提供的止鼾装置的产品模块图;2 is a product module diagram of a stagnation device provided by an embodiment of the present application;
图3是本申请实施例提供的鼾声识别方法的带通滤波的作用示意图;3 is a schematic diagram of the action of band pass filtering of the click recognition method provided by the embodiment of the present application;
图4是本申请实施例提供的鼾声识别方法的预加重处理的频率能量关系图;4 is a frequency energy relationship diagram of pre-emphasis processing of the click recognition method provided by the embodiment of the present application;
图5是本申请实施例提供的鼾声识别方法的总流程图;FIG. 5 is a general flowchart of a method for identifying a click sound provided by an embodiment of the present application; FIG.
图6是本申请实施例提供的鼾声识别方法中M1序列的能量求和原理图;6 is a schematic diagram of the energy summation of the M1 sequence in the click recognition method provided by the embodiment of the present application;
图7是本申请实施例提供的鼾声识别方法中M2序列的鼾声识别示意图;7 is a schematic diagram of a click recognition of an M2 sequence in a click recognition method provided by an embodiment of the present application;
图8是本申请实施例提供的鼾声识别方法的鼾声特征比对流程图;以及8 is a flowchart of a click characteristic comparison of the click recognition method provided by the embodiment of the present application;
图9是本申请实施例提供的执行鼾声识别方法的电子设备的硬件结构示意图。FIG. 9 is a schematic diagram showing the hardware structure of an electronic device for performing a click recognition method according to an embodiment of the present application.
具体实施方式Detailed ways
下面结合附图和实施方式对本实施例进行详细说明。The embodiment will be described in detail below with reference to the accompanying drawings and embodiments.
本申请涉及一种鼾声识别方法以及使用该方法的止鼾装置。该鼾声识别方法和装置将鼾声识别分为能量累计阶段和鼾声特征比对阶段,鼾声识别模型精简,计算量大大减少,可实现止鼾产品的小型化和便携式设计;同时,在安静睡眠环境下,对60分贝以上的鼾声,可实现100%准确的鼾声监测。The present application relates to a click recognition method and a stop device using the same. The snoring recognition method and device divides the snoring recognition into an energy accumulation phase and a snoring feature comparison phase, and the snoring recognition model is simplified, the calculation amount is greatly reduced, and the miniaturization and portable design of the stagnation product can be realized; meanwhile, in a quiet sleep environment For a beep of more than 60 decibels, 100% accurate bark monitoring can be achieved.
该能量累计阶段和鼾声特征比对阶段均由鼾声识别模块完成。The energy accumulation phase and the click characteristic comparison phase are all completed by the click recognition module.
请参考图2,所示为止鼾装置的产品模块图,本实施例的止鼾装置包括微控制器10、模数转换电路(ADC)30、鼾声识别模块40、声音拾取电路20以及止鼾机构50。其中,ADC代表模拟数字转换器(Analog to Digital Converter)。Referring to FIG. 2, a product module diagram of the device is shown. The stagnation device of the present embodiment includes a microcontroller 10, an analog-to-digital conversion circuit (ADC) 30, a click recognition module 40, a sound pickup circuit 20, and a stop mechanism. 50. Among them, ADC stands for Analog to Digital Converter.
该微控制器10控制整个止鼾装置的工作。该声音拾取电路20采样鼾声音频信号,本实施例中,该声音拾取电路20通过麦克风及其相关电路获取鼾声的波形信号(模拟信号)。该模数转换电路对该采样信号进行模数(ADC)转换。The microcontroller 10 controls the operation of the entire stagnation device. The sound pickup circuit 20 samples the click audio signal. In the present embodiment, the sound pickup circuit 20 acquires a waveform signal (analog signal) of the click sound through the microphone and its associated circuit. The analog to digital conversion circuit performs analog to digital (ADC) conversion on the sampled signal.
该止鼾机构50可以有多种实现方式,比如在睡眠监测手环实施例下,该止鼾机构应设计得小巧,唤醒力量适中。在睡眠监测床垫实施例中,该止鼾机构的尺寸和触压唤醒力量相对设计大一些。该与微控制器10连接可包括马达、齿轮组和触压结构,该止鼾装置在微控制器10信令指挥下由马达还有齿轮组传送动力,通过触压结构对被检测的睡眠用户施行唤醒动作。The anti-snoring mechanism 50 can be implemented in various ways. For example, in the sleep monitoring wristband embodiment, the anti-snoring mechanism should be designed to be small and the wake-up power is moderate. In the sleep monitoring mattress embodiment, the size and the pressure awakening force of the stagnation mechanism are relatively larger than the design. The connection to the microcontroller 10 can include a motor, a gear set, and a touch-pressure structure that transmits power from the motor and the gear set under the command of the microcontroller 10, through the touch-pressure structure to the detected sleep user. Perform a wake-up action.
该鼾声识别模块基于时域和该采样信号形成音频帧。该鼾声识别模块对音频帧移帧进行鼾声分析。该鼾声识别模块还对音频帧以1/X(其中,X为任意数)速率先进先出移帧。为了清楚说明技术方案,本实施例中,该X取值2。The click recognition module forms an audio frame based on the time domain and the sampled signal. The click recognition module performs a click analysis on the audio frame shift frame. The click recognition module also advances the first-in-first-out frame of the audio frame at a rate of 1/X (where X is an arbitrary number). In order to clarify the technical solution, in the embodiment, the value of X is 2.
该鼾声识别模块选取窗函数分帧,再将分帧得到的M1序列傅里叶变换至频域,并对频域频谱的低频能量求和,以完成鼾声识别的低频能量累计阶段。The click recognition module selects the window function to divide the frame, and then transforms the M1 sequence obtained by the frame into the frequency domain, and sums the low frequency energy of the frequency domain spectrum to complete the low frequency energy accumulation phase of the click recognition.
然后,该鼾声识别模块将低频能量和值在时域上组成N序列,接着完成对N序列的鼾声预判,预判合格的后N序列形成M2序列。该鼾声识别模块再对M2序列傅里叶变换至频域,并根据M2序列的谱峰特征匹配预设的鼾声参数,输出睡眠用户是否打鼾的监测鼾声信号。Then, the click recognition module combines the low frequency energy and the value into an N sequence in the time domain, and then completes the click prediction of the N sequence, and predicts that the qualified back N sequence forms the M2 sequence. The click recognition module further performs Fourier transform on the M2 sequence to the frequency domain, and matches the preset click parameters according to the spectral peak characteristics of the M2 sequence, and outputs a monitoring click signal of whether the sleep user is snoring.
该微控制器10收到该监测鼾声信号后,启动止鼾机构50触压受监测的用户,唤醒睡眠用户停止打鼾。After receiving the monitoring click signal, the microcontroller 10 activates the stop mechanism 50 to touch the monitored user and wake up the sleep user to stop snoring.
本申请实施例的鼾声识别模块对音频信号处理简单快捷,只有带通滤波计算,加窗计算,快速傅里叶变换(FFT)占用相对较多计算资源,但上述计算在普通MCU中均可实现。The click recognition module of the embodiment of the present application is simple and fast to process audio signals, and only band pass filter calculation, windowing calculation, and fast Fourier transform (FFT) occupy relatively more computing resources, but the above calculation can be implemented in a common MCU. .
该鼾声识别模块对音频帧以1/X速率移帧后,开始进行鼾声分析。为了清楚说明技术方案,本实施例中,该X取值2。也就是,数据M1序列1/2移帧,假设M1序列的长度为m,也就是将长度为m的M1序列进行m/2长度的顺序移位。The click recognition module starts the click analysis after the frame is moved at the 1/X rate. In order to clarify the technical solution, in the embodiment, the value of X is 2. That is, the data M1 sequence is 1/2 frame shifted, assuming that the length of the M1 sequence is m, that is, the M1 sequence of length m is sequentially shifted by m/2 length.
其中,该微控制器10通过模数转换电路30(ADC),将该声音拾取电路20获取的声音波形信号(模拟信号)转为数字信号再由该鼾声识别模块40进行鼾声识别计算和处理,并输出判别结果给微控制器10。The microcontroller 10 converts the sound waveform signal (analog signal) acquired by the sound pickup circuit 20 into a digital signal through an analog-to-digital conversion circuit 30 (ADC), and then performs the click recognition calculation and processing by the click recognition module 40. The discrimination result is output to the microcontroller 10.
该鼾声识别模块40在鼾声预判时:为了便于找到高频,本实施例中在预判时该鼾声识别模块40计算该N序列数据的最大最小值以及最大差值; 另外,该鼾声预判步骤中使用的参数包括过零率、波形幅值以及过零间隙,剔除初步判断非鼾声的信号。The click recognition module 40 calculates the maximum and minimum values of the N-series data during the pre-judgment in order to facilitate the high-frequency determination. In addition, the click prediction is performed. The parameters used in the steps include the zero-crossing rate, the amplitude of the waveform, and the zero-crossing gap, eliminating the signal that is initially judged to be non-squeaky.
在鼾声特征比对阶段,为了便于模块判断和计算,该鼾声识别模块40还用于完成该M2序列进行归一化处理、截短平均处理。In the click-to-speech feature comparison stage, in order to facilitate module judgment and calculation, the click recognition module 40 is further configured to perform normalization processing and truncation averaging processing on the M2 sequence.
本申请使用的窗函数可以为汉宁窗函数(Hanning)或者汉明窗函数(Hamming)或者布莱克曼窗函数(Blackman)。The window function used in this application may be a Hanning window function or a Hamming window function or a Blackman window function (Blackman).
为了具体加以说明,本申请实施例中,分帧信号采用汉宁窗(Hanning窗)处理以获得短时音频M1序列,该短时音频M1序列的的长度为m,按元素m乘以Hanning窗滤波器的系数。该窗函数可考虑用其他窗函数替换。只需满足快速傅里叶变换(FFT)计算时,不会存在频谱泄露即可。For specific description, in the embodiment of the present application, the framing signal is processed by a Hanning window to obtain a short-time audio M1 sequence, the length of the short-time audio M1 sequence is m, multiplied by the element m by the Hanning window. The coefficient of the filter. This window function can be considered to be replaced with other window functions. When there is only a Fast Fourier Transform (FFT) calculation, there is no spectrum leakage.
请一并参考图1和图5,所示为本申请实施例鼾声识别方法实施例的主要流程图和详细流程图。Referring to FIG. 1 and FIG. 5 together, a main flowchart and a detailed flowchart of an embodiment of a method for identifying a click sound according to an embodiment of the present application are shown.
该鼾声识别方法,为保证识别准确率,将鼾声识别分为能量区分阶段100和鼾声特征比对阶段300,其中在能量区分阶段100和鼾声特征比对阶段300中间设置鼾声预判步骤200。能量区分阶段100和鼾声特征比对阶段300均在频域进行,具体方法介绍如下:In order to ensure the recognition accuracy, the click recognition method is divided into an energy discrimination phase 100 and a click feature comparison phase 300, wherein a click prediction step 200 is set in the middle of the energy discrimination phase 100 and the click feature comparison phase 300. Both the energy discrimination phase 100 and the click feature comparison phase 300 are performed in the frequency domain. The specific method is as follows:
步骤S101:鼾声音频信号采样,本实施例中,鼾声信号采样采用硬件麦克风以及相关电路;Step S101: sampling the chirp audio signal. In this embodiment, the chirp signal sampling uses a hardware microphone and a related circuit;
步骤S103:对采样信号进行模数(ADC)转换,止鼾装置设置模数转换电路30对该采样信号进行模数(ADC)转换;Step S103: Perform analog-to-digital (ADC) conversion on the sampling signal, and set the analog-to-digital conversion circuit 30 to perform analog-to-digital (ADC) conversion on the sampling signal;
步骤S106:基于时域和该采样信号形成音频帧,对音频帧移帧,开始鼾声分析的能量区分阶段100。Step S106: forming an audio frame based on the time domain and the sampling signal, shifting the frame of the audio frame, and starting the energy discrimination phase 100 of the click analysis.
首先窗函数分帧,其中,通过对音频帧的移帧并采用窗函数获取短时音频数据M1序列,分帧信号采用汉宁窗处理。该短时音频M1序列的的长度为m,按元素m乘以Hanning窗滤波器的系数。该窗函数可用其他窗函数替换,需满足快速傅里叶变换(FFT)计算时不会存在频谱泄露。First, the window function is framed, wherein the frame of the short-time audio data M1 is obtained by moving the frame of the audio frame and adopting a window function, and the framed signal is processed by Hanning window. The length of the short-term audio M1 sequence is m, multiplied by the element m by the coefficient of the Hanning window filter. This window function can be replaced by other window functions, and there is no spectrum leakage when fast Fourier transform (FFT) calculation is required.
形成音频帧后,该鼾声识别方法包括对采样音频帧进行带通滤波处理。对如图3所示鼾声识别方法的带通滤波的作用示意图,该带通滤波对音频信号数字带通滤波的处理是将鼾声语音频率成分的信号保留,而将直流信号低频信号A(非语音频率范围)和高频信号B(非语音频率范围)滤除。After the audio frame is formed, the click recognition method includes performing band pass filtering on the sampled audio frame. A schematic diagram of the action of the bandpass filtering of the click recognition method shown in FIG. 3, the bandpass filtering is performed on the digital bandpass filtering of the audio signal to preserve the signal of the chirp voice frequency component, and the DC signal low frequency signal A (non-speech) Frequency range) and high frequency signal B (non-speech frequency range) are filtered out.
该步骤S106中的移帧是针对音频帧,该音频帧以1/X速率先进先出移帧;该短时音频M1序列可以1/2移帧,也可以采用1/x移帧(其中x为任意数),需要满足数据是不断进行先进先出移动操作。The frame shift in the step S106 is for an audio frame, and the audio frame is FIFO framed at a 1/X rate; the short-time audio M1 sequence can be 1/2 framed or 1/x framed (where x For any number), the data needs to be met continuously for the first-in, first-out move operation.
其中,针对该短时音频M1序列,该方法还包括对序列的音频信号预加重处理。如图4所示曲线C,为了提升频谱在高频部分的幅度,该鼾声识别模块还用于对音频信号预加重处理。其中,该音频信号的预加重处理采用一个数字高通滤波器,用于将语音信号的高频部分信号幅度加以提升,使频谱变平坦。Wherein, for the short-term audio M1 sequence, the method further comprises pre-emphasizing the audio signal of the sequence. As shown in curve C of Figure 4, in order to increase the amplitude of the spectrum in the high frequency portion, the click recognition module is also used to pre-emphasize the audio signal. The pre-emphasis processing of the audio signal uses a digital high-pass filter for boosting the amplitude of the high-frequency portion of the speech signal to flatten the spectrum.
步骤S108:将分帧后经过预加重处理的M1序列傅里叶变换至频域,并对频域频谱的低频能量求和;请参考图6所示M1序列的能量求和原理图,鼾声频谱能量主要集中于低频部分,通过获取短时低频信号能量组成长时能量信号序列。对长时能量序列进行频谱分析,得到长时音频是否为鼾声的判断。因此此处对频谱的低频部分能量求和,T代表低频能量之和。Step S108: Fourier transforming the M1 sequence subjected to pre-emphasis processing to the frequency domain, and summing the low-frequency energy of the frequency domain spectrum; refer to the energy summation diagram of the M1 sequence shown in FIG. The energy is mainly concentrated in the low frequency part, and the long-term energy signal sequence is composed by acquiring the short-time low-frequency signal energy. Perform a spectrum analysis on the long-term energy sequence to determine whether the long-term audio is a click. Therefore, the low-frequency partial energy of the spectrum is summed here, and T represents the sum of the low-frequency energy.
如图6所示,该M1序列做快速傅里叶变换(FFT),将短时音频波形序列(m个点的序列)做m点的FFT变换,得到短时音频的频谱分布结果。其中,计算低频能量求和值T是将短时音频M1序列的频谱分布的低频能量求和。As shown in FIG. 6, the M1 sequence is subjected to Fast Fourier Transform (FFT), and a short-time audio waveform sequence (a sequence of m points) is subjected to FFT conversion of m points to obtain a spectrum distribution result of short-time audio. Wherein, calculating the low frequency energy sum value T is summing the low frequency energy of the spectrum distribution of the short time audio M1 sequence.
步骤S110:低频能量和值T在时域上组成N序列,具体为计算求得的低频能量T和值数据存入N长度的N序列中,以得到音频信息的一个长时N序列。在鼾声特征比对阶段之前,对该N序列做鼾声预判200,预判合格的N序列形成进入鼾声特征比对阶段做最终鼾声判断的M2序列。Step S110: The low frequency energy and the value T form an N sequence in the time domain. Specifically, the calculated low frequency energy T and the value data are stored in the N sequence of the N length to obtain a long time N sequence of the audio information. Before the click characteristic phase, the N sequence is pre-determined 200, and the qualified N sequence forms an M2 sequence that enters the click characteristic phase to make the final click judgment.
请一并参考图5,该鼾声预判步骤200包括:Referring to FIG. 5 together, the click prediction step 200 includes:
对该长时N序列滤除直流处理,以去除明显非鼾声的能量信号, 其中,该N序列滤除直流处理用于将该N序列数据中的直流分量滤除,以得到一个交流序列。The long-time N-sequence is filtered to remove DC processing to remove the apparently non-squeaky energy signal, wherein the N-sequence filtering DC processing is used to filter the DC component in the N-sequence data to obtain an alternating sequence.
为了便于找到高频,对长时N序列数据最大最小值计算以及最大差值计算,其中,N序列最大、最小值计算以及最大差值计算可以采取的方式是先得到该N序列的最大值,最小值再计算最大最小值的差值;In order to find the high frequency, the maximum and minimum calculation of the long-time N-sequence data and the maximum difference calculation, wherein the N-sequence maximum, minimum calculation, and maximum difference calculation can be performed by first obtaining the maximum value of the N-sequence. The minimum value is then calculated as the difference between the maximum and minimum values;
另外,鼾声预判还包括剔除不规律音频数据,汽车鸣笛等高能杂音等,其剔除预判参数包括过零率、波形幅值以及过零间隙。In addition, the squeak pre-judgment also includes the elimination of irregular audio data, high-noise noise such as car whistle, etc., and the culling pre-judgment parameters include zero-crossing rate, waveform amplitude, and zero-crossing gap.
步骤S112:预判合格的N序列形成M2序列,进入鼾声特征比对阶段300。Step S112: Predetermine the qualified N sequence to form the M2 sequence, and enter the click feature comparison phase 300.
在优选方案中,在鼾声特征比对阶段300,为了使识别计算更精确得到硬件的支持,该鼾声识别方法还包括:对该M2序列进行归一化处理、截短平均处理。其中,N序列归一化处理,截短平均处理,通过最大值,最小值,将序列N的元素幅度值归一化到统一的数据值大小区间。并且将数据进行短序列n的平均处理(n<N/4)。记归一化并平均处理后的序列为M2序列。归一化处理,截短平均处理,对识别计算结果影响不大,也可以不设置该归一化处理、截短平均处理步骤。其中,归一化处理、截短平均处理的窗函数也采取同前述的汉宁窗函数(Hanning)。In a preferred embodiment, in the click feature comparison stage 300, in order to make the recognition calculation more accurate to obtain hardware support, the click recognition method further includes: performing normalization processing and truncation averaging processing on the M2 sequence. Among them, the N-sequence normalization process, the truncated averaging process, normalizes the element amplitude values of the sequence N to a uniform data value size interval by the maximum value and the minimum value. And the data is subjected to averaging processing of the short sequence n (n < N / 4). The normalized and averaged sequence is the M2 sequence. The normalization process and the truncated averaging process have little effect on the recognition calculation result, and the normalization process and the truncation averaging process step may not be set. Among them, the window function of the normalization processing and the truncated averaging processing also adopts the Hanning window function (Hanning) as described above.
步骤S114:对M2序列傅里叶变换至频域,并根据M2序列的谱峰特征匹配预设的鼾声参数,输出监测鼾声信号给止鼾装置。Step S114: Fourier transforming the M2 sequence to the frequency domain, and matching the preset chirp parameter according to the spectral peak characteristic of the M2 sequence, and outputting the monitoring chirp signal to the anti-snoring device.
具体请参考图7,M1序列做快速傅里叶变换(FFT),基于M1序列或者N序列(设置预判步骤时)做M2点的快速傅里叶变换,得到长时音频的频谱分布结果。在频谱分析处理中,记录频谱分布中最大的三个谱峰(P1、P2、P3)信息,如三个谱峰(P1、P2、P3)均满足判定条件,则判定为该长时音频为鼾声,并输出鼾声信号;否则识别为非鼾声,再一并输出非鼾声信号。For details, please refer to FIG. 7. The M1 sequence is subjected to Fast Fourier Transform (FFT), and the M1 point fast Fourier transform is performed based on the M1 sequence or the N sequence (when the pre-judging step is set), and the spectrum distribution result of the long-term audio is obtained. In the spectrum analysis process, the information of the three largest spectral peaks (P1, P2, P3) in the spectral distribution is recorded. If the three spectral peaks (P1, P2, P3) satisfy the determination condition, it is determined that the long-term audio is Beep, and output a beep signal; otherwise, it is recognized as a non-beep, and then a non-beep signal is output.
请参考图8,所示为鼾声识别方法的鼾声特征比对流程图。该鼾声最后筛选根据鼾声的特点判断方法不同,在本实施例中,从M2序列找到的频谱峰值最大的3个点(P1、P2、P3)的后续判断流程如下:Please refer to FIG. 8 , which is a flow chart of the click characteristic comparison of the click recognition method. The final screening of the click is different according to the characteristics of the click. In this embodiment, the subsequent judgment process of the three points (P1, P2, P3) with the largest peak of the spectrum found from the M2 sequence is as follows:
获取频谱能量最大的三个频点(P1、P2、P3);Obtain three frequency points (P1, P2, P3) with the largest spectrum energy;
计算最大频谱能量与第二频谱能量的倍数;Calculating a multiple of the maximum spectral energy and the second spectral energy;
倍数大于2的符合鼾声特征,进一步判断大于倍数2的频点是否符合呼吸率,符合呼吸率时判别为鼾声;不符合呼吸率时判别为非鼾声;If the multiple is greater than 2, the squeaking characteristic is further determined, and it is further determined whether the frequency greater than the multiple 2 meets the breathing rate, and the squeak is determined when the breathing rate is met; when the breathing rate is not met, the sound is determined to be non-squeaking;
如果单个频点倍数小于2则不符合鼾声特征,可进一步判断三个频点是否都对应满足呼吸率,符合呼吸率时判别为鼾声;不符合呼吸率时判别为非鼾声。If the single frequency point multiple is less than 2, it does not meet the snoring characteristics, and it can be further determined whether the three frequency points all correspond to the breathing rate, and the squeak is determined when the breathing rate is met; when the breathing rate is not met, the sound is determined to be non-squeaking.
以上实施例是以最大三个谱峰来判断,也可以基于单个或者多个谱峰来具体进行判断规则的确定。The above embodiment is judged by the maximum three spectral peaks, and the determination rule can be specifically determined based on single or multiple spectral peaks.
本申请鼾声识别方法和止鼾装置,将鼾声识别分为能量累计阶段和鼾声特征比对阶段,鼾声识别模型精简,计算量大大减少,可实现止鼾产品的小型化和便携式设计同时保证较高的鼾声监测准确度;该鼾声识别方法和装置对信噪比较高的鼾声识别准确,鼾声频谱能量主要集中于低频部分,通过获取短时低频信号能量组成长时能量信号序列再对长时能量序列进行频谱分析,得到长时音频是否为鼾声的判断,鼾声识别精度高。The snoring recognition method and the snoring device of the present application divide the snoring recognition into an energy accumulation phase and a snoring feature comparison phase, and the snoring recognition model is simplified, the calculation amount is greatly reduced, and the miniaturization and portable design of the stagnation product can be realized while ensuring high The detection accuracy of the click sound; the method and device for detecting the click sound accurately recognizes the click sound with high signal to noise, and the spectrum energy of the click sound is mainly concentrated in the low frequency part, and the long-term energy signal sequence is composed by acquiring the short-time low-frequency signal energy and then the long-term energy is generated. The sequence is subjected to spectrum analysis to obtain whether the long-term audio is a click sound, and the click recognition accuracy is high.
图9是本申请实施例提供的鼾声识别方法的电子设备600的硬件结构示意图,如图9所示,该电子设备600包括:FIG. 9 is a schematic diagram of the hardware structure of the electronic device 600 according to the method for identifying the click sound provided by the embodiment of the present application. As shown in FIG. 9, the electronic device 600 includes:
一个或多个微控制器610以及存储器620,图9中以一个微控制器610为例。One or more of the microcontroller 610 and the memory 620, a microcontroller 610 is exemplified in FIG.
微控制器610和存储器620可以通过总线或者其他方式连接,图9中以通过总线连接为例。The microcontroller 610 and the memory 620 can be connected by a bus or other means, as exemplified by a bus connection in FIG.
存储器620作为一种非易失性计算机可读存储介质,可用于存储非易失性软件程序、非易失性计算机可执行程序以及模块,如本申请实施例中的鼾声识别方法对应的程序指令/模块(例如,附图2所示的鼾声识别模块40)。微控制器610通过运行存储在存储器620中的非易失性软件程序、指令以及模块,从而执行终端设备或者服务器的各种功能应用以及完成数据处理,即实现上述方法实施例的鼾声识别方法。The memory 620 is used as a non-volatile computer readable storage medium, and can be used for storing a non-volatile software program, a non-volatile computer-executable program, and a module, such as a program instruction corresponding to the click recognition method in the embodiment of the present application. / Module (for example, the click recognition module 40 shown in Figure 2). The microcontroller 610 performs various function applications of the terminal device or the server and completes data processing by executing non-volatile software programs, instructions, and modules stored in the memory 620, that is, implementing the click recognition method of the above method embodiments.
存储器620可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据鼾声识别装置的使用所创建的数据等。此外,存储器620可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实施例中,存储器620可选包括相对于微控制器610远程设置的存储器,这些远程存储器可以通过网络连接至鼾声识别装置。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 620 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to the use of the click recognition device, and the like. Moreover, memory 620 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, memory 620 can optionally include memory remotely located relative to microcontroller 610, which can be connected to the click recognition device via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
所述一个或者多个模块存储在所述存储器620中,当被所述一个或者多个微控制器610执行时,执行上述任意方法实施例中的鼾声识别方法,例如,执行以上描述的图1中的方法步骤101至步骤S114,图5中的方法步骤100至步骤300,实现图4中的模块31-34的功能,和实现图2中的鼾声识别模块40的功能。The one or more modules are stored in the memory 620, and when executed by the one or more microcontrollers 610, perform a click recognition method in any of the above method embodiments, for example, performing the above described FIG. The method steps 101 to S114 in FIG. 5, the method steps 100 to 300 in FIG. 5, implement the functions of the modules 31-34 in FIG. 4, and implement the functions of the click recognition module 40 in FIG.
上述产品可执行本申请实施例所提供的方法,具备执行方法相应的功能模块和有益效果。未在本实施例中详尽描述的技术细节,可参见本申请实施例所提供的方法。The above products can perform the methods provided by the embodiments of the present application, and have the corresponding functional modules and beneficial effects of the execution method. For technical details that are not described in detail in this embodiment, reference may be made to the method provided by the embodiments of the present application.
本申请实施例的电子设备以多种形式存在,包括但不限于:The electronic device of the embodiment of the present application exists in various forms, including but not limited to:
(1)移动通信设备:这类设备的特点是具备移动通信功能,并且以提供话音、数据通信为主要目标。这类终端包括:智能手机(例如iPhone)、多媒体手机、功能性手机,以及低端手机等。(1) Mobile communication devices: These devices are characterized by mobile communication functions and are mainly aimed at providing voice and data communication. Such terminals include: smart phones (such as iPhone), multimedia phones, functional phones, and low-end phones.
(2)超移动个人计算机设备:这类设备属于个人计算机的范畴,有计算和处理功能,一般也具备移动上网特性。这类终端包括:PDA、MID和UMPC设备等,例如iPad。(2) Ultra-mobile personal computer equipment: This type of equipment belongs to the category of personal computers, has computing and processing functions, and generally has mobile Internet access. Such terminals include: PDAs, MIDs, and UMPC devices, such as the iPad.
(3)服务器:提供计算服务的设备,服务器的构成包括微控制器、硬盘、内存、系统总线等,服务器和通用的计算机架构类似,但是由于需要提供高可靠的服务,因此在处理能力、稳定性、可靠性、安全性、可扩展性、可管理性等方面要求较高。(3) Server: A device that provides computing services. The server consists of a microcontroller, a hard disk, a memory, a system bus, etc. The server is similar to a general-purpose computer architecture, but because of the need to provide highly reliable services, it is capable of processing and stable. Sex, reliability, security, scalability, manageability, etc. are highly demanding.
(4)其他具有数据交互功能的电子装置。(4) Other electronic devices with data interaction functions.
本申请实施例提供了一种非易失性计算机可读存储介质,所述计算机可读存储介质存储有计算机可执行指令,该计算机可执行指令被一个或多个微控制器执行,例如图9中的一个微控制器610,可使得上述一个或多个微控制器可执行上述任意方法实施例中的鼾声识别方法,例如,执行以上描述的图1中的方法步骤101至步骤S114,图5中的方法步骤100至步骤300,实现图4中的模块31-34的功能,和实现图2中的鼾声识别模块40的功能。Embodiments of the present application provide a non-transitory computer readable storage medium storing computer-executable instructions that are executed by one or more microcontrollers, such as FIG. One of the microcontrollers 610 can cause the one or more microcontrollers to perform the click recognition method in any of the above method embodiments, for example, to perform the method steps 101 to S114 in FIG. 1 described above, FIG. In steps 100 through 300 of the method, the functions of the modules 31-34 in FIG. 4 are implemented, and the functions of the click recognition module 40 in FIG. 2 are implemented.
以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
通过以上的实施方式的描述,本领域普通技术人员可以清楚地了解到各实施方式可借助软件加通用硬件平台的方式来实现,当然也可以通过硬件。本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory, ROM)或随机存储记忆体(Random Access Memory, RAM)等。Through the description of the above embodiments, those skilled in the art can clearly understand that the various embodiments can be implemented by means of software plus a general hardware platform, and of course, by hardware. A person skilled in the art can understand that all or part of the process of implementing the above embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable storage medium. When executed, the flow of an embodiment of the methods as described above may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (Random Access). Memory, RAM), etc.
最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;在本申请的思路下,以上实施例或者不同实施例中的技术特征之间也可以进行组合,步骤可以以任意顺序实现,并存在如上所述的本申请的不同方面的许多其它变化,为了简明,它们没有在细节中提供;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application, and are not limited thereto; in the idea of the present application, the technical features in the above embodiments or different embodiments may also be combined. The steps may be carried out in any order, and there are many other variations of the various aspects of the present application as described above, which are not provided in the details for the sake of brevity; although the present application has been described in detail with reference to the foregoing embodiments, The skilled person should understand that the technical solutions described in the foregoing embodiments may be modified, or some of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the embodiments of the present application. The scope of the technical solution.

Claims (13)

  1. 一种鼾声识别方法,包括: A method for identifying a click, comprising:
    鼾声音频信号采样;Beep audio signal sampling;
    对采样信号进行模数(ADC)转换;Performing analog-to-digital (ADC) conversion on the sampled signal;
    基于时域和所述采样信号形成音频帧,对音频帧移帧进行鼾声分析;Forming an audio frame based on the time domain and the sampled signal, and performing a click analysis on the audio frame shift frame;
    窗函数分帧,将分帧得到的M1序列傅里叶变换至频域,并对频域频谱的低频能量求和;The window function is framed, and the M1 sequence obtained by the frame is Fourier transformed into the frequency domain, and the low frequency energy of the frequency domain spectrum is summed;
    低频能量和值在时域上组成N序列,并对所述N序列做鼾声预判,预判合格的N序列形成M2序列;The low frequency energy and the value form an N sequence in the time domain, and the N sequence is predicted by the click, and the qualified N sequence is predicted to form the M2 sequence;
    对所述M2序列傅里叶变换至频域,并根据所述M2序列的谱峰特征匹配预设的鼾声参数,输出监测鼾声信号。The M2 sequence is Fourier transformed to the frequency domain, and the preset chirp parameters are matched according to the spectral peak characteristics of the M2 sequence, and the monitoring chirp signal is output.
  2. 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1 wherein the method further comprises:
    音频帧以1/X速率先进先出移帧,其中,X为任意数;The audio frame is moved first and last out at a 1/X rate, where X is an arbitrary number;
    其中,所述方法还包括音频信号预加重处理。Wherein, the method further comprises an audio signal pre-emphasis process.
  3. 根据权利要求2所述的方法,其中,所述对N序列做鼾声预判包括:The method of claim 2 wherein said predicting the N sequence comprises:
    计算所述N序列数据的最大最小值以及最大差值; Calculating a maximum minimum value and a maximum difference value of the N sequence data;
    其中,所述鼾声预判的参数包括过零率、波形幅值以及过零间隙。The parameters predicted by the click include a zero-crossing rate, a waveform amplitude, and a zero-crossing gap.
  4. 根据权利要求1-3任一项所述的方法,其中,所述方法还包括: The method of any of claims 1-3, wherein the method further comprises:
    对所述M2序列进行归一化处理、截短平均处理。The M2 sequence is subjected to normalization processing and truncated averaging processing.
  5. 根据权利要求4所述的方法,其中,The method of claim 4, wherein
    所述窗函数为汉宁窗函数(Hanning)或者汉明窗函数(Hamming)或者布莱克曼窗函数(Blackman)。The window function is a Hanning window function (Haning) or a Hamming window function (Hamming) or a Blackman window function (Blackman).
  6. 一种止鼾装置,包括微控制器、模数转换电路(ADC)和声音拾取电路,所述声音拾取电路用于采样鼾声音频信号,所述模数转换电路用于对所述采样信号进行模数(ADC)转换,还包括:A stagnation device comprising a microcontroller, an analog to digital conversion circuit (ADC) and a sound pickup circuit, the sound pickup circuit for sampling a click audio signal, the analog to digital conversion circuit for modulating the sample signal Number (ADC) conversion, also includes:
    鼾声识别模块,所述鼾声识别模块用于基于时域和采样信号形成音频帧,对音频帧移帧进行鼾声分析;用于窗函数分帧,将分帧得到的M1序列傅里叶变换至频域,并对频域频谱的低频能量求和;用于低频能量和值在时域上组成N序列,并对频域N序列做鼾声预判,预判合格的N序列形成M2序列;还用于对M2序列傅里叶变换至频域,并根据M2序列的谱峰特征匹配预设的鼾声参数,输出监测鼾声信号;以及a click recognition module, wherein the click recognition module is configured to form an audio frame based on the time domain and the sampled signal, perform a click analysis on the audio frame shift frame; and use the window function to divide the frame, and transform the M1 sequence obtained by the frame to the frequency. Domain, and summing the low-frequency energy of the frequency domain spectrum; for the low-frequency energy and value to form the N-sequence in the time domain, and pre-judging the frequency-domain N-sequence, pre-determining the qualified N-sequence to form the M2 sequence; Performing a Fourier transform on the M2 sequence to the frequency domain, and matching the preset chirp parameters according to the spectral peak characteristics of the M2 sequence, and outputting the monitoring chirp signal;
    止鼾机构,其中,所述微控制器收到所述监测鼾声信号后,启动止鼾机构触压受监测的用户。The stopping mechanism, wherein the microcontroller initiates the stop mechanism to touch the monitored user after receiving the monitoring click signal.
  7. 根据权利要求6所述的止鼾装置,其中,所述鼾声识别模块还用于对音频帧以1/X速率先进先出移帧(其中,X为任意数);所述鼾声识别模块还用于对音频信号预加重处理。The anti-snoring device according to claim 6, wherein the click recognition module is further configured to move the frame at a 1/X rate on the audio frame (where X is an arbitrary number); the click recognition module further uses Pre-emphasis processing of the audio signal.
  8. 根据权利要求7所述的止鼾装置,其中,所述鼾声识别模块还用于在鼾声预判时:The anti-snoring device according to claim 7, wherein the click recognition module is further configured to:
    计算所述N序列数据的最大最小值以及最大差值; Calculating a maximum minimum value and a maximum difference value of the N sequence data;
    其中,所述鼾声预判的参数包括过零率、波形幅值以及过零间隙。The parameters predicted by the click include a zero-crossing rate, a waveform amplitude, and a zero-crossing gap.
  9. 根据权利要求6-8任一项所述的止鼾装置,其中,所述鼾声识别模块还用于对所述M2序列进行归一化处理、截短平均处理。The anti-snoring device according to any one of claims 6-8, wherein the click recognition module is further configured to perform normalization processing and truncation averaging processing on the M2 sequence.
  10. 根据权利要求9所述的止鼾装置,其中,The anti-snoring device according to claim 9, wherein
    所述窗函数为汉宁窗函数(Hanning)或者汉明窗函数(Hamming)或者布莱克曼窗函数(Blackman)。The window function is a Hanning window function (Haning) or a Hamming window function (Hamming) or a Blackman window function (Blackman).
  11. 一种电子设备,其中,包括:An electronic device, comprising:
    至少一个微控制器;以及,At least one microcontroller; and,
    与所述至少一个微控制器通信连接的存储器;其中,a memory communicatively coupled to the at least one microcontroller; wherein
    所述存储器存储有可被所述至少一个微控制器执行的指令,所述指令被所述至少一个微控制器执行,以使所述至少一个微控制器能够执行权利要求1-5任一项所述的方法。The memory stores instructions executable by the at least one microcontroller, the instructions being executed by the at least one microcontroller to enable the at least one microcontroller to perform any of claims 1-5 Said method.
  12. 一种非易失性计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机可执行指令,所述计算机可执行指令用于使计算机执行权利要求1-5任一项所述的方法。A non-transitory computer readable storage medium, wherein the computer readable storage medium stores computer executable instructions for causing a computer to perform the method of any of claims 1-5 method.
  13. 一种计算机程序产品,其中,所述计算机程序产品包括存储在非易失性计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行权利要求1-5任一项所述的方法。A computer program product, comprising: a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program instructions are executed by a computer, The computer performs the method of any of claims 1-5.
PCT/CN2017/118950 2017-12-27 2017-12-27 Snore recognition method and snore-stopping device WO2019127088A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201780009009.0A CN108697328B (en) 2017-12-27 2017-12-27 Snore identification method and snore stopping device
PCT/CN2017/118950 WO2019127088A1 (en) 2017-12-27 2017-12-27 Snore recognition method and snore-stopping device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/118950 WO2019127088A1 (en) 2017-12-27 2017-12-27 Snore recognition method and snore-stopping device

Publications (1)

Publication Number Publication Date
WO2019127088A1 true WO2019127088A1 (en) 2019-07-04

Family

ID=63844124

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/118950 WO2019127088A1 (en) 2017-12-27 2017-12-27 Snore recognition method and snore-stopping device

Country Status (2)

Country Link
CN (1) CN108697328B (en)
WO (1) WO2019127088A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109512571B (en) * 2018-11-09 2021-08-27 京东方科技集团股份有限公司 Snore stopping device and method and computer readable storage medium
CN110182155B (en) * 2019-05-14 2021-04-30 中国第一汽车股份有限公司 Voice control method of vehicle-mounted control system, vehicle-mounted control system and vehicle
CN111128202B (en) * 2019-12-10 2022-09-30 龙马智芯(珠海横琴)科技有限公司 Sound processing method and device
CN111345782B (en) * 2020-03-12 2023-10-10 深圳数联天下智能科技有限公司 Snore type identification method and electronic equipment
CN114027801B (en) * 2021-12-17 2022-09-09 广东工业大学 Method and system for recognizing sleep snore and restraining snore
CN115206329B (en) * 2022-09-15 2023-01-24 深圳市倍轻松科技股份有限公司 Method, device, electronic equipment and storage medium for determining snore signals

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140107493A1 (en) * 2012-06-22 2014-04-17 Fitbit, Inc. Portable Biometric Monitoring Devices and Methods of Operating Same
CN105592777A (en) * 2013-07-08 2016-05-18 瑞思迈传感器技术有限公司 Method and system for sleep management
CN106137130A (en) * 2016-06-28 2016-11-23 南京大学 A kind of sleep period recognition methods based on Audio Processing
CN106264839A (en) * 2016-08-05 2017-01-04 南通海联助眠科技产品有限公司 Intelligent snore stopping pillow
CN106821337A (en) * 2017-04-13 2017-06-13 南京理工大学 A kind of sound of snoring source title method for having a supervision

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000000214A (en) * 1998-06-15 2000-01-07 Arata Nemoto Sleeping monitor device
US20070118054A1 (en) * 2005-11-01 2007-05-24 Earlysense Ltd. Methods and systems for monitoring patients for clinical episodes
US20110092839A1 (en) * 2008-11-17 2011-04-21 Toronto Rehabilitation Institute Mask and method for use in respiratory monitoring and diagnostics
CN201840571U (en) * 2010-04-16 2011-05-25 金同(广州)医疗保健品有限公司 Snore tester
US20130030257A1 (en) * 2010-05-14 2013-01-31 Kai Medical, Inc. Systems and methods for non-contact multiparameter vital signs monitoring, apnea therapy, apnea diagnosis, and snore therapy
EP2629661B1 (en) * 2010-10-20 2016-12-14 Koninklijke Philips N.V. Apparatus for diagnosing obstructive sleep apnea with an awake patient
CN102429662B (en) * 2011-11-10 2014-04-09 大连理工大学 Screening system for sleep apnea syndrome in family environment
CN103190911B (en) * 2013-04-16 2014-09-17 南京理工大学 Upper airway change monitoring method based on snore formants and power ratio tracks
CN203539513U (en) * 2013-09-27 2014-04-16 张红英 Snore reminding pillow
CN104739412B (en) * 2013-12-29 2017-11-03 中国移动通信集团公司 A kind of method and apparatus being monitored to sleep apnea
US20170035328A1 (en) * 2015-08-07 2017-02-09 Fitbit, Inc. User identification via data collected from sensors of a wearable fitness monitor
CN105232208A (en) * 2015-09-16 2016-01-13 朱王勇 Smart snore-ceasing back mat based on postural therapy
CN105249927B (en) * 2015-10-30 2019-01-11 厦门兔顽智能科技有限公司 A kind of sound of snoring recognition methods and device for preventing snoring
CN106691382B (en) * 2016-12-26 2020-12-15 赛博龙科技(北京)有限公司 Snore detection method and device based on time-frequency similarity
CN106725488B (en) * 2016-12-27 2023-08-18 深圳大学 Wireless field intensity respiration detection method and device and respiration detector

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140107493A1 (en) * 2012-06-22 2014-04-17 Fitbit, Inc. Portable Biometric Monitoring Devices and Methods of Operating Same
CN105592777A (en) * 2013-07-08 2016-05-18 瑞思迈传感器技术有限公司 Method and system for sleep management
CN106137130A (en) * 2016-06-28 2016-11-23 南京大学 A kind of sleep period recognition methods based on Audio Processing
CN106264839A (en) * 2016-08-05 2017-01-04 南通海联助眠科技产品有限公司 Intelligent snore stopping pillow
CN106821337A (en) * 2017-04-13 2017-06-13 南京理工大学 A kind of sound of snoring source title method for having a supervision

Also Published As

Publication number Publication date
CN108697328A (en) 2018-10-23
CN108697328B (en) 2021-07-13

Similar Documents

Publication Publication Date Title
WO2019127088A1 (en) Snore recognition method and snore-stopping device
US20220007965A1 (en) Methods and apparatus for detection of disordered breathing
Chauhan et al. BreathPrint: Breathing acoustics-based user authentication
Ren et al. Fine-grained sleep monitoring: Hearing your breathing with smartphones
CN107910020B (en) Snore detection method, device, equipment and storage medium
WO2021021388A1 (en) Remote health monitoring systems and methods
Almazaydeh et al. Apnea detection based on respiratory signal classification
CN107898432B (en) Snore detection method, device, equipment and medium
Abeyratne et al. Obstructive sleep apnea screening by integrating snore feature classes
US20230190140A1 (en) Methods and apparatus for detection and monitoring of health parameters
EP2451359B1 (en) Noise reduction of breathing signals
Qian et al. Automatic detection, segmentation and classification of snore related signals from overnight audio recording
WO2020238954A1 (en) Apnea monitoring method and device
Zhang et al. A novel wheeze detection method for wearable monitoring systems
CN110013222A (en) A kind of system for sleep apnea detection
Doukas et al. Automated sleep breath disorders detection utilizing patient sound analysis
CN110942784A (en) Snore classification system based on support vector machine
EP3866687A1 (en) A method and apparatus for diagnosis of maladies from patient sounds
Kang et al. Snoring and apnea detection based on hybrid neural networks
US20220346705A1 (en) Apparatus and method for sleep-disordered breathing estimation based on sound analysis
Luo et al. Design of embedded real-time system for snoring and OSA detection based on machine learning
Castillo et al. Characterization of microphones for snoring and breathing events analysis in mHealth
CN107811610B (en) Respiration rate detection method and device, electronic equipment and storage medium
JP5765338B2 (en) Voice processing apparatus and method of operating voice processing apparatus
Çavuşoğlu et al. Acoustics of snoring and automatic snore sound detection in children

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17935970

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17935970

Country of ref document: EP

Kind code of ref document: A1